Image encoding/decoding image method and device, and recording medium storing bit stream

ABSTRACT

An image encoding/decoding method and apparatus are provided. An image encoding method performed by an image encoding apparatus may comprise determining an intra-prediction mode of a current block, configuring a reference sample of intra prediction by using at least one of a neighbor pixel of the current block in a current picture and a pixel of a reference picture, and performing intra prediction for the current block based on the intra-prediction mode and the reference sample.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a U.S. National Stage Application of InternationalApplication No. PCT/KR2017/013377, filed on Nov. 22, 2017, which claimsthe benefit under 35 USC 119(a) and 365(b) of Korean Patent ApplicationNo. 10-2016-0155814, filed on Nov. 22, 2016 and Korean PatentApplication No. 10-2016-0155815, filed on Nov. 22, 2016 in the KoreanIntellectual Property Office, the entire disclosure of which isincorporated herein by reference for all purposes.

TECHNICAL FIELD

The present invention relates to a method and apparatus forencoding/decoding an image and a recording medium storing a bitstream.Particularly, the present invention relates to a method and apparatusfor encoding/decoding an image performing intra prediction using asample of a reference picture and a recording medium storing a bitstreamgenerated by an image encoding method/apparatus of the presentinvention.

BACKGROUND ART

Recently, demands for high-resolution and high-quality images such ashigh definition (HD) images and ultra high definition (UHD) images, haveincreased in various application fields. However, higher resolution andquality image data has increasing amounts of data in comparison withconventional image data. Therefore, when transmitting image data byusing a medium such as conventional wired and wireless broadbandnetworks, or when storing image data by using a conventional storagemedium, costs of transmitting and storing increase. In order to solvethese problems occurring with an increase in resolution and quality ofimage data, high-efficiency image encoding/decoding techniques arerequired for higher-resolution and higher-quality images.

Image compression technology includes various techniques, including: aninter-prediction technique of predicting a pixel value included in acurrent picture from a previous or subsequent picture of the currentpicture; an intra-prediction technique of predicting a pixel valueincluded in a current picture by using pixel information in the currentpicture; a transform and quantization technique for compressing energyof a residual signal; an entropy encoding technique of assigning a shortcode to a value with a high appearance frequency and assigning a longcode to a value with a low appearance frequency; etc. Image data may beeffectively compressed by using such image compression technology, andmay be transmitted or stored.

DISCLOSURE Technical Problem

An object of the present invention is to provide a method and apparatusfor encoding and decoding an image to enhance compression efficiency.

Another object of the present invention is to provide a method andapparatus for encoding and decoding an image performing intra predictionusing a sample of a reference picture.

Another object of the present invention is to provide a recording mediumstoring a bitstream generated by an image encoding method/apparatus ofthe present invention.

Technical Solution

An image encoding method performed by an image encoding apparatusaccording to the present invention may comprise determining anintra-prediction mode of a current block, configuring a reference sampleof intra prediction by using at least one of a neighbor pixel of thecurrent block in a current picture and a pixel of a reference picture,and performing intra prediction for the current block based on theintra-prediction mode and the reference sample.

In the image encoding method according to the present invention, thepixel of the reference picture may be a pixel included in a similar areain the reference picture of the current block.

In the image encoding method according to the present invention, thesimilar area in the reference picture may be specified based onsimilarity calculation using the current block and the referencepicture.

In the image encoding method according to the present invention, thesimilar area in the reference picture may be an area in the referencepicture having a same position as the current block.

In the image encoding method according to the present invention, thesimilar area in the reference picture may be specified based onprediction information of a neighbor block of the current block.

The image encoding method according to the present invention may furthercomprise encoding information for specifying the similar area in thereference picture.

In the image encoding method according to the present invention, theintra-prediction mode may include a directionality mode usingomni-directional prediction, and a same intra-prediction mode number maybe assigned to directionality modes in 180-degree opposite directions.

The image encoding method according to the present invention may furthercomprise encoding information on whether an intra-prediction directionis a direction indicating a right or a bottom of the current block.

In the image encoding method according to the present invention, theconfiguring of the reference sample may include, by using the pixel ofthe reference picture, padding an unavailable reference sample orfiltering the configured reference sample.

An image encoding apparatus according to the present invention maycomprise an intra-prediction unit which is configured to determine anintra-prediction mode of a current block, configure a reference sampleof intra prediction by using at least one of a neighbor pixel of thecurrent block in a current picture and a pixel of a reference picture,and perform intra prediction for the current block based on theintra-prediction mode and the reference sample.

An image decoding method performed by an image decoding apparatusaccording to the present invention may comprise decoding anintra-prediction mode of a current block, configuring a reference sampleof intra prediction by using at least one of a neighbor pixel of thecurrent block in a current picture and a pixel of a reference picture,and performing intra prediction for the current block based on theintra-prediction mode and the reference sample.

In the image decoding method according to the present invention, thepixel of the reference picture may be a pixel included in a similar areain the reference picture of the current block.

In the image decoding method according to the present invention, thesimilar area in the reference picture may be specified based onsimilarity calculation using the current block and the referencepicture.

In the image decoding method according to the present invention, thesimilar area in the reference picture may be an area in the referencepicture having a same position as the current block.

In the image decoding method according to the present invention, thesimilar area in the reference picture may be specified based onprediction information of a neighbor block of the current block.

The image decoding method according to the present invention may furthercomprise decoding information for specifying the similar area in thereference picture.

The image decoding method according to the present invention may furthercomprise decoding information on whether an intra-prediction directionis a direction indicating a right or a bottom of the current block whenthe intra-prediction mode is a directionality mode, and theintra-prediction direction may include the directionality mode usingomni-directional prediction, a same intra-prediction mode number may beassigned to directionality modes in 180-degree opposite directions, andthe intra-prediction mode of the current block may be decoded based onthe intra-prediction mode number and the information on whether theintra-prediction direction is the direction indicating the right or thebottom of the current block.

In the image decoding method according to the present invention, theconfiguring of the reference sample may include, by using the pixel ofthe reference picture, padding an unavailable reference sample orfiltering the configured reference sample.

The image decoding apparatus according to the present invention maycomprise an intra-prediction unit which is configured to decode anintra-prediction mode of a current block, configure a reference sampleof intra prediction by using at least one of a neighbor pixel of thecurrent block in a current picture and a pixel of a reference picture,and perform intra prediction for the current block based on theintra-prediction mode and the reference sample.

A recording medium according to the present invention may store abitstream generated by an image encoding method according to the presentinvention.

Advantageous Effects

According to the present invention, a method and apparatus for encodingand decoding an image to enhance compression efficiency may be provided.

According to the present invention, a method and apparatus for encodingand decoding an image performing intra prediction using a sample of areference picture.

According to the present invention, a recording medium storing abitstream generated by an image encoding method/apparatus of the presentinvention may be provided.

DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram showing configurations of an encodingapparatus according to an embodiment of the present invention.

FIG. 2 is a block diagram showing configurations of a decoding apparatusaccording to an embodiment of the present invention.

FIG. 3 is a view schematically showing a partition structure of an imagewhen encoding and decoding the image.

FIG. 4 is a view for explaining an embodiment of a process of intraprediction.

FIG. 5 is a diagram illustrating an embodiment of an inter-pictureprediction process.

FIG. 6 is a view illustrating the case where similarity between thecurrent block and the unreconstructed area is high.

FIG. 7 is a view illustrating an intra-prediction mode according to thepresent invention.

FIG. 8 is a view illustrating directionality of an intra-prediction modefor an area having pixel distribution with directionality.

FIG. 9 is a view illustrating a method of configuring a reference pixelfor intra prediction of the current block by using the pixel of thecurrent picture and the pixel of the reference picture.

FIG. 10 is a view illustrating operation of an encoder performing intraprediction according to the present invention.

FIG. 11 is a view illustrating operation of a decoder performing intraprediction according to the present invention.

FIG. 12 is a view illustrating Inter_DC prediction according to thepresent invention.

FIG. 13 is a view illustrating an embodiment of Inter_DC predictionaccording to the present invention.

FIG. 14 is a view illustrating operation of an encoder performinganother embodiment of DC prediction according to the present invention.

FIG. 15 is a view illustrating Inter_Planar prediction according to thepresent invention.

FIG. 16 is a view illustrating another embodiment of the reference pixelthat may be used in Inter_Planar prediction.

FIG. 17 is a view illustrating another embodiment of Inter_Planarprediction according to the present invention.

FIG. 18 is a view illustrating another embodiment of an omni-directionalprediction mode according to the present invention.

FIG. 19 is a view illustrating directionality prediction referring tothe pixel of the similar area in the reference picture.

FIG. 20 is a view illustrating Inter_Angular prediction according toanother embodiment of the present invention.

FIG. 21 is a view illustrating operation of a decoder performing angularprediction according to the present invention.

FIG. 22 is a view illustrating a filtering method using the pixel of thesimilar area in the reference picture.

FIG. 23 is a view illustrating an embodiment of an encoder performingprediction using the pixel of the similar area in the reference picture.

FIG. 24 is a view illustrating another embodiment of an encoderperforming prediction using the pixel of the similar area in thereference picture.

FIG. 25 is a view illustrating an embodiment of a decoder performingprediction using the pixel of the similar area in the reference picture.

FIG. 26 is a view illustrating another embodiment of a decoderperforming prediction using the pixel of the similar area in thereference picture.

FIG. 27 is a view illustrating an embodiment of an apparatus forperforming Inter_DC prediction in a similar area pixel prediction unitaccording to the present invention.

FIG. 28 is a view illustrating an embodiment of an apparatus forperforming Inter_Planar prediction in a similar area pixel predictionunit according to the present invention.

FIG. 29 is a view illustrating an embodiment of an apparatus forperforming Inter_Angular prediction in a similar area pixel predictionunit according to the present invention.

FIG. 30 is a view illustrating reference sample padding.

FIG. 31 is a view illustrating reference sample filtering.

FIG. 32 is a view illustrating a method of performing reference samplepadding using the pixel of the reference picture when some referencepixels of intra prediction do not exist.

FIG. 33 is a view illustrating a method of performing reference samplepadding using the pixel of the reference picture when all referencepixels of intra prediction do not exist.

FIG. 34 is a view illustrating an embodiment of reference sample paddingaccording to the present invention.

FIG. 35 is a view illustrating reference sample filtering using thepixel of the reference picture.

FIG. 36 is a view illustrating an embodiment of reference samplefiltering according to the present invention.

FIG. 37 is a view illustrating an embodiment of an encoder that performsinter reference sample padding and/or inter reference sample filteringaccording to the present invention.

FIG. 38 is a view illustrating another embodiment of an encoder thatperforms inter reference sample padding and/or inter reference samplefiltering according to the present invention.

FIG. 39 is a view illustrating an embodiment of a decoder that performsinter reference sample padding and/or inter reference sample filteringaccording to the present invention.

FIG. 40 is a view illustrating another embodiment of a decoder thatperforms inter reference sample padding and/or inter reference samplefiltering according to the present invention.

FIG. 41 is a view illustrating an embodiment of a detailed configurationof the inter reference sample configuration unit according to thepresent invention.

FIG. 42 is a diagram showing a transfer modification of an embodiment ofthe geometric modification of the image according to the presentinvention.

FIG. 43 is a diagram showing a size modification of an embodiment of ageometrical modification of an image according to the present invention.

FIG. 44 is a diagram showing a rotation modification of an embodiment ofa geometrical modification of an image according to the presentinvention.

FIG. 45 is a diagram showing an affine modification of an embodiment ofa geometrical modification of an image according to the presentinvention.

FIG. 46 is a diagram showing a projective modification of an embodimentof a geometrical modification of an image according to the presentinvention.

FIG. 47 is a diagram showing an example method of implementing ahomography according to the present invention.

FIG. 48 is an example method of deriving a relational formula betweentwo points corresponding within two images according to the presentinvention.

FIG. 49 is a diagram showing a method of generating a geometricallymodified image based on a geometric modification matrix and an originalimage according to the present invention.

FIG. 50 is a diagram showing a method of generating a geometricallymodified image by using inverse mapping according to the presentinvention.

FIG. 51 is a diagram showing a method of generating a geometric modifiedimage based on a geometric modification matrix and an original imageaccording to the present invention

FIG. 52 is a diagram showing a bilinear interpolation among variousinterpolation methods explained with reference to FIG. 51 according toan embodiment the present invention.

MODE FOR CARRYING OUT THE INVENTION

A variety of modifications may be made to the present invention andthere are various embodiments of the present invention, examples ofwhich will now be provided with reference to drawings and described indetail. However, the present invention is not limited thereto, althoughthe exemplary embodiments can be construed as including allmodifications, equivalents, or substitutes in a technical concept and atechnical scope of the present invention. The similar reference numeralsrefer to the same or similar functions in various aspects. In thedrawings, the shapes and dimensions of elements may be exaggerated forclarity. In the following detailed description of the present invention,references are made to the accompanying drawings that show, by way ofillustration, specific embodiments in which the invention may bepracticed. These embodiments are described in sufficient detail toenable those skilled in the art to implement the present disclosure. Itshould be understood that various embodiments of the present disclosure,although different, are not necessarily mutually exclusive. For example,specific features, structures, and characteristics described herein, inconnection with one embodiment, may be implemented within otherembodiments without departing from the spirit and scope of the presentdisclosure. In addition, it should be understood that the location orarrangement of individual elements within each disclosed embodiment maybe modified without departing from the spirit and scope of the presentdisclosure. The following detailed description is, therefore, not to betaken in a limiting sense, and the scope of the present disclosure isdefined only by the appended claims, appropriately interpreted, alongwith the full range of equivalents to what the claims claim.

Terms used in the specification, ‘first’, ‘second’, etc. can be used todescribe various components, but the components are not to be construedas being limited to the terms. The terms are only used to differentiateone component from other components. For example, the ‘first’ componentmay be named the ‘second’ component without departing from the scope ofthe present invention, and the ‘second’ component may also be similarlynamed the ‘first’ component. The term ‘and/or’ includes a combination ofa plurality of items or any one of a plurality of terms.

It will be understood that when an element is simply referred to asbeing ‘connected to’ or ‘coupled to’ another element without being‘directly connected to’ or ‘directly coupled to’ another element in thepresent description, it may be ‘directly connected to’ or ‘directlycoupled to’ another element or be connected to or coupled to anotherelement, having the other element intervening therebetween. In contrast,it should be understood that when an element is referred to as being“directly coupled” or “directly connected” to another element, there areno intervening elements present.

Furthermore, constitutional parts shown in the embodiments of thepresent invention are independently shown so as to representcharacteristic functions different from each other. Thus, it does notmean that each constitutional part is constituted in a constitutionalunit of separated hardware or software. In other words, eachconstitutional part includes each of enumerated constitutional parts forconvenience. Thus, at least two constitutional parts of eachconstitutional part may be combined to form one constitutional part orone constitutional part may be divided into a plurality ofconstitutional parts to perform each function. The embodiment where eachconstitutional part is combined and the embodiment where oneconstitutional part is divided are also included in the scope of thepresent invention, if not departing from the essence of the presentinvention.

The terms used in the present specification are merely used to describeparticular embodiments, and are not intended to limit the presentinvention. An expression used in the singular encompasses the expressionof the plural, unless it has a clearly different meaning in the context.In the present specification, it is to be understood that terms such as“including”, “having”, etc. are intended to indicate the existence ofthe features, numbers, steps, actions, elements, parts, or combinationsthereof disclosed in the specification, and are not intended to precludethe possibility that one or more other features, numbers, steps,actions, elements, parts, or combinations thereof may exist or may beadded. In other words, when a specific element is referred to as being“included”, elements other than the corresponding element are notexcluded, but additional elements may be included in embodiments of thepresent invention or the scope of the present invention.

In addition, some of constituents may not be indispensable constituentsperforming essential functions of the present invention but be selectiveconstituents improving only performance thereof. The present inventionmay be implemented by including only the indispensable constitutionalparts for implementing the essence of the present invention except theconstituents used in improving performance. The structure including onlythe indispensable constituents except the selective constituents used inimproving only performance is also included in the scope of the presentinvention.

Hereinafter, embodiments of the present invention will be described indetail with reference to the accompanying drawings. In describingexemplary embodiments of the present invention, well-known functions orconstructions will not be described in detail since they mayunnecessarily obscure the understanding of the present invention. Thesame constituent elements in the drawings are denoted by the samereference numerals, and a repeated description of the same elements willbe omitted.

In addition, hereinafter, an image may mean a picture configuring avideo, or may mean the video itself. For example, “encoding or decodingor both of an image” may mean “encoding or decoding or both of a video”,and may mean “encoding or decoding or both of one image among images ofa video.” Here, a picture and the image may have the same meaning.

Description of Terms

Encoder: means an apparatus performing encoding.

Decoder: means an apparatus performing decoding

Block: is an M×N array of a sample. Herein, M and N mean positiveintegers, and the block may mean a sample array of a two-dimensionalform. The block may refer to a unit. A current block my mean an encodingtarget block that becomes a target when encoding, or a decoding targetblock that becomes a target when decoding. In addition, the currentblock may be at least one of an encode block, a prediction block, aresidual block, and a transform block.

Sample: is a basic unit constituting a block. It may be expressed as avalue from 0 to 2^(Bd)−1 according to a bit depth (B_(d)). In thepresent invention, the sample may be used as a meaning of a pixel.

Unit: refers to an encoding and decoding unit. When encoding anddecoding an image, the unit may be a region generated by partitioning asingle image. In addition, the unit may mean a subdivided unit when asingle image is partitioned into subdivided units during encoding ordecoding. When encoding and decoding an image, a predetermined processfor each unit may be performed. A single unit may be partitioned intosub-units that have sizes smaller than the size of the unit. Dependingon functions, the unit may mean a block, a macroblock, a coding treeunit, a code tree block, a coding unit, a coding block), a predictionunit, a prediction block, a residual unit), a residual block, atransform unit, a transform block, etc. In addition, in order todistinguish a unit from a block, the unit may include a luma componentblock, a chroma component block associated with the luma componentblock, and a syntax element of each color component block. The unit mayhave various sizes and forms, and particularly, the form of the unit maybe a two-dimensional geometrical figure such as a rectangular shape, asquare shape, a trapezoid shape, a triangular shape, a pentagonal shape,etc. In addition, unit information may include at least one of a unittype indicating the coding unit, the prediction unit, the transformunit, etc., and a unit size, a unit depth, a sequence of encoding anddecoding of a unit, etc.

Coding Tree Unit: is configured with a single coding tree block of aluma component Y, and two coding tree blocks related to chromacomponents Cb and Cr. In addition, it may mean that including the blocksand a syntax element of each block. Each coding tree unit may bepartitioned by using at least one of a quad-tree partitioning method anda binary-tree partitioning method to configure a lower unit such ascoding unit, prediction unit, transform unit, etc. It may be used as aterm for designating a pixel block that becomes a process unit whenencoding/decoding an image as an input image.

Coding Tree Block: may be used as a term for designating any one of a Ycoding tree block, Cb coding tree block, and Cr coding tree block.

Neighbor Block: means a block adjacent to a current block. The blockadjacent to the current block may mean a block that comes into contactwith a boundary of the current block, or a block positioned within apredetermined distance from the current block. The neighbor block maymean a block adjacent to a vertex of the current block. Herein, theblock adjacent to the vertex of the current block may mean a blockvertically adjacent to a neighbor block that is horizontally adjacent tothe current block, or a block horizontally adjacent to a neighbor blockthat is vertically adjacent to the current block.

Reconstructed Neighbor block: means a neighbor block adjacent to acurrent block and which has been already spatially/temporally encoded ordecoded. Herein, the reconstructed neighbor block may mean areconstructed neighbor unit. A reconstructed spatial neighbor block maybe a block within a current picture and which has been alreadyreconstructed through encoding or decoding or both. A reconstructedtemporal neighbor block is a block at the same position as the currentblock of the current picture within a reference picture, or a neighborblock thereof.

Unit Depth: means a partitioned degree of a unit. In a tree structure, aroot node may be the highest node, and a leaf node may be the lowestnode. In addition, when a unit is expressed as a tree structure, a levelin which a unit is present may mean a unit depth.

Bitstream: means a bitstream including encoding image information.

Parameter Set: corresponds to header information among a configurationwithin a bitstream. At least one of a video parameter set, a sequenceparameter set, a picture parameter set, and an adaptation parameter setmay be included in a parameter set. In addition, a parameter set mayinclude a slice header, and tile header information.

Parsing: may mean determination of a value of a syntax element byperforming entropy decoding, or may mean the entropy decoding itself.

Symbol: may mean at least one of a syntax element, a coding parameter,and a transform coefficient value of an encoding/decoding target unit.In addition, the symbol may mean an entropy encoding target or anentropy decoding result.

Prediction Unit: means a basic unit when performing prediction such asinter-prediction, intra-prediction, inter-compensation,intra-compensation, and motion compensation. A single prediction unitmay be partitioned into a plurality of partitions with a small size, ormay be partitioned into a lower prediction unit.

Prediction Unit Partition: means a form obtained by partitioning aprediction unit.

Reference Picture List: means a list including one or more referencepictures used for inter-picture prediction or motion compensation. LC(List Combined), L0 (List 0), L1 (List 1), L2 (List 2), L3 (List 3) andthe like are types of reference picture lists. One or more referencepicture lists may be used for inter-picture prediction.

Inter-picture prediction Indicator: may mean an inter-picture predictiondirection (uni-directional prediction, bi-directional prediction, andthe like) of a current block. Alternatively, the inter-pictureprediction indicator may mean the number of reference pictures used togenerate a prediction block of a current block. Further alternatively,the inter-picture prediction indicator may mean the number of predictionblocks used to perform inter-picture prediction or motion compensationwith respect to a current block.

Reference Picture Index: means an index indicating a specific referencepicture in a reference picture list.

Reference Picture: may mean a picture to which a specific block refersfor inter-picture prediction or motion compensation.

Motion Vector: is a two-dimensional vector used for inter-pictureprediction or motion compensation and may mean an offset between areference picture and an encoding/decoding target picture. For example,(mvX, mvY) may represent a motion vector, mvX may represent a horizontalcomponent, and mvY may represent a vertical component.

Motion Vector Candidate: may mean a block that becomes a predictioncandidate when predicting a motion vector, or a motion vector of theblock. A motion vector candidate may be listed in a motion vectorcandidate list.

Motion Vector Candidate List: may mean a list of motion vectorcandidates.

Motion Vector Candidate Index: means an indicator indicating a motionvector candidate in a motion vector candidate list. It is also referredto as an index of a motion vector predictor.

Motion Information: may mean information including a motion vector, areference picture index, an inter-picture prediction indicator, and atleast any one among reference picture list information, a referencepicture, a motion vector candidate, a motion vector candidate index, amerge candidate, and a merge index.

Merge Candidate List: means a list composed of merge candidates.

Merge Candidate: means a spatial merge candidate, a temporal mergecandidate, a combined merge candidate, a combined bi-prediction mergecandidate, a zero merge candidate, or the like. The merge candidate mayhave an inter-picture prediction indicator, a reference picture indexfor each list, and motion information such as a motion vector.

Merge Index: means information indicating a merge candidate within amerge candidate list. The merge index may indicate a block used toderive a merge candidate, among reconstructed blocks spatially and/ortemporally adjacent to a current block. The merge index may indicate atleast one item in the motion information possessed by a merge candidate.

Transform Unit: means a basic unit when performing encoding/decodingsuch as transform, inverse-transform, quantization, dequantization,transform coefficient encoding/decoding of a residual signal. A singletransform unit may be partitioned into a plurality of transform unitshaving a small size.

Scaling: means a process of multiplying a transform coefficient level bya factor. A transform coefficient may be generated by scaling atransform coefficient level. The scaling also may be referred to asdequantization.

Quantization Parameter: may mean a value used when generating atransform coefficient level of a transform coefficient duringquantization. The quantization parameter also may mean a value used whengenerating a transform coefficient by scaling a transform coefficientlevel during dequantization. The quantization parameter may be a valuemapped on a quantization step size.

Delta Quantization Parameter: means a difference value between apredicted quantization parameter and a quantization parameter of anencoding/decoding target unit.

Scan: means a method of sequencing coefficients within a block or amatrix. For example, changing a two-dimensional matrix of coefficientsinto a one-dimensional matrix may be referred to as scanning, andchanging a one-dimensional matrix of coefficients into a two-dimensionalmatrix may be referred to as scanning or inverse scanning.

Transform Coefficient: may mean a coefficient value generated aftertransform is performed in an encoder. It may mean a coefficient valuegenerated after at least one of entropy decoding and dequantization isperformed in a decoder. A quantized level obtained by quantizing atransform coefficient or a residual signal, or a quantized transformcoefficient level also may fall within the meaning of the transformcoefficient.

Quantized Level: means a value generated by quantizing a transformcoefficient or a residual signal in an encoder. Alternatively, thequantized level may mean a value that is a dequantization target toundergo dequantization in a decoder. Similarly, a quantized transformcoefficient level that is a result of transform and quantization alsomay fall within the meaning of the quantized level.

Non-zero Transform Coefficient: means a transform coefficient having avalue other than zero, or a transform coefficient level having a valueother than zero.

Quantization Matrix: means a matrix used in a quantization process or adequantization process performed to improve subjective or objectiveimage quality. The quantization matrix also may be referred to as ascaling list.

Quantization Matrix Coefficient: means each element within aquantization matrix. The quantization matrix coefficient also may bereferred to as a matrix coefficient.

Default Matrix: means a predetermined quantization matrix preliminarilydefined in an encoder or a decoder.

Non-default Matrix: means a quantization matrix that is notpreliminarily defined in an encoder or a decoder but is signaled by auser.

FIG. 1 is a block diagram showing a configuration of an encodingapparatus according to an embodiment to which the present invention isapplied.

An encoding apparatus 100 may be an encoder, a video encoding apparatus,or an image encoding apparatus. A video may include at least one image.The encoding apparatus 100 may sequentially encode at least one image.

Referring to FIG. 1, the encoding apparatus 100 may include a motionprediction unit 111, a motion compensation unit 112, an intra-predictionunit 120, a switch 115, a subtractor 125, a transform unit 130, aquantization unit 140, an entropy encoding unit 150, a dequantizationunit 160, a inverse-transform unit 170, an adder 175, a filter unit 180,and a reference picture buffer 190.

The encoding apparatus 100 may perform encoding of an input image byusing an intra mode or an inter mode or both. In addition, encodingapparatus 100 may generate a bitstream through encoding the input image,and output the generated bitstream. The generated bitstream may bestored in a computer readable recording medium, or may be streamedthrough a wired/wireless transmission medium. When an intra mode is usedas a prediction mode, the switch 115 may be switched to an intra.Alternatively, when an inter mode is used as a prediction mode, theswitch 115 may be switched to an inter mode. Herein, the intra mode maymean an intra-prediction mode, and the inter mode may mean aninter-prediction mode. The encoding apparatus 100 may generate aprediction block for an input block of the input image. In addition, theencoding apparatus 100 may encode a residual of the input block and theprediction block after the prediction block being generated. The inputimage may be called as a current image that is a current encodingtarget. The input block may be called as a current block that is currentencoding target, or as an encoding target block.

When a prediction mode is an intra mode, the intra-prediction unit 120may use a pixel value of a block that has been already encoded/decodedand is adjacent to a current block as a reference pixel. Theintra-prediction unit 120 may perform spatial prediction by using areference pixel, or generate prediction samples of an input block byperforming spatial prediction. Herein, the intra prediction may meanintra-prediction,

When a prediction mode is an inter mode, the motion prediction unit 111may retrieve a region that best matches with an input block from areference image when performing motion prediction, and deduce a motionvector by using the retrieved region. The reference image may be storedin the reference picture buffer 190.

The motion compensation unit 112 may generate a prediction block byperforming motion compensation using a motion vector. Herein,inter-prediction may mean inter-prediction or motion compensation.

When the value of the motion vector is not an integer, the motionprediction unit 111 and the motion compensation unit 112 may generatethe prediction block by applying an interpolation filter to a partialregion of the reference picture. In order to perform inter-pictureprediction or motion compensation on a coding unit, it may be determinedthat which mode among a skip mode, a merge mode, an advanced motionvector prediction (AMVP) mode, and a current picture referring mode isused for motion prediction and motion compensation of a prediction unitincluded in the corresponding coding unit. Then, inter-pictureprediction or motion compensation may be differently performed dependingon the determined mode.

The subtractor 125 may generate a residual block by using a residual ofan input block and a prediction block. The residual block may be calledas a residual signal. The residual signal may mean a difference betweenan original signal and a prediction signal. In addition, the residualsignal may be a signal generated by transforming or quantizing, ortransforming and quantizing a difference between the original signal andthe prediction signal. The residual block may be a residual signal of ablock unit.

The transform unit 130 may generate a transform coefficient byperforming transform of a residual block, and output the generatedtransform coefficient. Herein, the transform coefficient may be acoefficient value generated by performing transform of the residualblock. When a transform skip mode is applied, the transform unit 130 mayskip transform of the residual block.

A quantized level may be generated by applying quantization to thetransform coefficient or to the residual signal. Hereinafter, thequantized level may be also called as a transform coefficient inembodiments.

The quantization unit 140 may generate a quantized level by quantizingthe transform coefficient or the residual signal according to aparameter, and output the generated quantized level. Herein, thequantization unit 140 may quantize the transform coefficient by using aquantization matrix.

The entropy encoding unit 150 may generate a bitstream by performingentropy encoding according to a probability distribution on valuescalculated by the quantization unit 140 or on coding parameter valuescalculated when performing encoding, and output the generated bitstream.The entropy encoding unit 150 may perform entropy encoding of pixelinformation of an image and information for decoding an image. Forexample, the information for decoding the image may include a syntaxelement.

When entropy encoding is applied, symbols are represented so that asmaller number of bits are assigned to a symbol having a high chance ofbeing generated and a larger number of bits are assigned to a symbolhaving a low chance of being generated, and thus, the size of bit streamfor symbols to be encoded may be decreased. The entropy encoding unit150 may use an encoding method for entropy encoding such as exponentialGolomb, context-adaptive variable length coding (CAVLC),context-adaptive binary arithmetic coding (CABAC), etc. For example, theentropy encoding unit 150 may perform entropy encoding by using avariable length coding/code (VLC) table. In addition, the entropyencoding unit 150 may deduce a binarization method of a target symboland a probability model of a target symbol/bin, and perform arithmeticcoding by using the deduced binarization method, and a context model.

In order to encode a transform coefficient level, the entropy encodingunit 150 may change a two-dimensional block form coefficient into aone-dimensional vector form by using a transform coefficient scanningmethod.

A coding parameter may include information (flag, index, etc.) such assyntax element that is encoded in an encoder and signaled to a decoder,and information derived when performing encoding or decoding. The codingparameter may mean information required when encoding or decoding animage. For example, at least one value or a combination form of aunit/block size, a unit/block depth, unit/block partition information,unit/block partition structure, whether to partition of a quad-treeform, whether to partition of a binary-tree form, a partition directionof a binary-tree form (horizontal direction or vertical direction), apartition form of a binary-tree form (symmetric partition or asymmetricpartition), an intra-prediction mode/direction, a reference samplefiltering method, a prediction block filtering method, a predictionblock filter tap, a prediction block filter coefficient, aninter-prediction mode, motion information, a motion vector, a referencepicture index, a inter-prediction angle, an inter-prediction indicator,a reference picture list, a reference picture, a motion vector predictorcandidate, a motion vector candidate list, whether to use a merge mode,a merge candidate, a merge candidate list, whether to use a skip mode,an interpolation filter type, an interpolation filter tab, aninterpolation filter coefficient, a motion vector size, a presentationaccuracy of a motion vector, a transform type, a transform size,information of whether or not a primary (first) transform is used,information of whether or not a secondary transform is used, a primarytransform index, a secondary transform index, information of whether ornot a residual signal is present, a coded block pattern, a coded blockflag (CBF), a quantization parameter, a quantization matrix, whether toapply an intra loop filter, an intra loop filter coefficient, an intraloop filter tab, an intra loop filter shape/form, whether to apply adeblocking filter, a deblocking filter coefficient, a deblocking filtertab, a deblocking filter strength, a deblocking filter shape/form,whether to apply an adaptive sample offset, an adaptive sample offsetvalue, an adaptive sample offset category, an adaptive sample offsettype, whether to apply an adaptive in-loop filter, an adaptive in-loopfilter coefficient, an adaptive in-loop filter tab, an adaptive in-loopfilter shape/form, a binarization/inverse-binarization method, a contextmodel determining method, a context model updating method, whether toperform a regular mode, whether to perform a bypass mode, a context bin,a bypass bin, a transform coefficient, a transform coefficient level, atransform coefficient level scanning method, an imagedisplaying/outputting sequence, slice identification information, aslice type, slice partition information, tile identificationinformation, a tile type, tile partition information, a picture type, abit depth, and information of a luma signal or chroma signal may beincluded in the coding parameter.

Herein, signaling the flag or index may mean that a corresponding flagor index is entropy encoded and included in a bitstream by an encoder,and may mean that the corresponding flag or index is entropy decodedfrom a bitstream by a decoder.

When the encoding apparatus 100 performs encoding throughinter-prediction, an encoded current image may be used as a referenceimage for another image that is processed afterwards. Accordingly, theencoding apparatus 100 may reconstruct or decode the encoded currentimage, or store the reconstructed or decoded image as a reference image.

A quantized level may be dequantized in the dequantization unit 160, ormay be inverse-transformed in the inverse-transform unit 170. Adequantized or inverse-transformed coefficient or both may be added witha prediction block by the adder 175. By adding the dequantized orinverse-transformed coefficient or both with the prediction block, areconstructed block may be generated. Herein, the dequantized orinverse-transformed coefficient or both may mean a coefficient on whichat least one of dequantization and inverse-transform is performed, andmay mean a reconstructed residual block.

A reconstructed block may pass through the filter unit 180. The filterunit 180 may apply at least one of a deblocking filter, a sampleadaptive offset (SAO), and an adaptive loop filter (ALF) to thereconstructed block or a reconstructed image. The filter unit 180 may becalled as an in-loop filter.

The deblocking filter may remove block distortion generated inboundaries between blocks. In order to determine whether or not to applya deblocking filter, whether or not to apply a deblocking filter to acurrent block may be determined based pixels included in several rows orcolumns which are included in the block. When a deblocking filter isapplied to a block, another filter may be applied according to arequired deblocking filtering strength.

In order to compensate an encoding error, a proper offset value may beadded to a pixel value by using a sample adaptive offset. The sampleadaptive offset may correct an offset of a deblocked image from anoriginal image by a pixel unit. A method of partitioning pixels of animage into a predetermined number of regions, determining a region towhich an offset is applied, and applying the offset to the determinedregion, or a method of applying an offset in consideration of edgeinformation on each pixel may be used.

The adaptive loop filter may perform filtering based on a comparisonresult of the filtered reconstructed image and the original image.Pixels included in an image may be partitioned into predeterminedgroups, a filter to be applied to each group may be determined, anddifferential filtering may be performed for each group. Information ofwhether or not to apply the ALF may be signaled by coding units (CUs),and a form and coefficient of the ALF to be applied to each block mayvary.

The reconstructed block or the reconstructed image having passed throughthe filter unit 180 may be stored in the reference picture buffer 190.FIG. 2 is a block diagram showing a configuration of a decodingapparatus according to an embodiment and to which the present inventionis applied.

A decoding apparatus 200 may a decoder, a video decoding apparatus, oran image decoding apparatus.

Referring to FIG. 2, the decoding apparatus 200 may include an entropydecoding unit 210, a dequantization unit 220, a inverse-transform unit230, an intra-prediction unit 240, a motion compensation unit 250, anadder 225, a filter unit 260, and a reference picture buffer 270.

The decoding apparatus 200 may receive a bitstream output from theencoding apparatus 100. The decoding apparatus 200 may receive abitstream stored in a computer readable recording medium, or may receivea bitstream that is streamed through a wired/wireless transmissionmedium. The decoding apparatus 200 may decode the bitstream by using anintra mode or an inter mode. In addition, the decoding apparatus 200 maygenerate a reconstructed image generated through decoding or a decodedimage, and output the reconstructed image or decoded image.

When a prediction mode used when decoding is an intra mode, a switch maybe switched to an intra. Alternatively, when a prediction mode used whendecoding is an inter mode, a switch may be switched to an inter mode.

The decoding apparatus 200 may obtain a reconstructed residual block bydecoding the input bitstream, and generate a prediction block. When thereconstructed residual block and the prediction block are obtained, thedecoding apparatus 200 may generate a reconstructed block that becomes adecoding target by adding the reconstructed residual block with theprediction block. The decoding target block may be called a currentblock.

The entropy decoding unit 210 may generate symbols by entropy decodingthe bitstream according to a probability distribution. The generatedsymbols may include a symbol of a quantized level form. Herein, anentropy decoding method may be a inverse-process of the entropy encodingmethod described above.

In order to decode a transform coefficient level, the entropy decodingunit 210 may change a one-directional vector form coefficient into atwo-dimensional block form by using a transform coefficient scanningmethod.

A quantized level may be dequantized in the dequantization unit 220, orinverse-transformed in the inverse-transform unit 230. The quantizedlevel may be a result of dequantizing or inverse-transforming or both,and may be generated as a reconstructed residual block. Herein, thedequantization unit 220 may apply a quantization matrix to the quantizedlevel.

When an intra mode is used, the intra-prediction unit 240 may generate aprediction block by performing spatial prediction that uses a pixelvalue of a block adjacent to a decoding target block and which has beenalready decoded.

When an inter mode is used, the motion compensation unit 250 maygenerate a prediction block by performing motion compensation that usesa motion vector and a reference image stored in the reference picturebuffer 270. When the value of the motion vector is not an integer, themotion compensation unit 250 may generate the prediction block byapplying the interpolation filter to a partial region of a referencepicture. In order to perform motion compensation on a coding unit, itmay be first determined that which mode among a skip mode, a merge mode,an AMVP mode, and a current picture reference mode is to be used formotion compensation of a prediction unit included in the correspondingcoding unit, and the motion compensation may then be performed accordingto the determined mode.

The adder 225 may generate a reconstructed block by adding thereconstructed residual block with the prediction block. The filter unit260 may apply at least one of a deblocking filter, a sample adaptiveoffset, and an adaptive loop filter to the reconstructed block orreconstructed image. The filter unit 260 may output the reconstructedimage. The reconstructed block or reconstructed image may be stored inthe reference picture buffer 270 and used when performinginter-prediction.

FIG. 3 is a view schematically showing a partition structure of an imagewhen encoding and decoding the image. FIG. 3 schematically shows anexample of partitioning a single unit into a plurality of lower units.

In order to efficiently partition an image, when encoding and decoding,a coding unit (CU) may be used. The coding unit may be used as a basicunit when encoding/decoding the image. In addition, the coding unit maybe used as a unit for distinguishing an intra mode and an inter modewhen encoding/decoding the image. The coding unit may be a basic unitused for prediction, transform, quantization, inverse-transform,dequantization, or an encoding/decoding process of a transformcoefficient.

Referring to FIG. 3, an image 300 is sequentially partitioned in alargest coding unit (LCU), and a LCU unit is determined as a partitionstructure. Herein, the LCU may be used in the same meaning as a codingtree unit (CTU). A unit partitioning may mean partitioning a blockassociated with to the unit. In block partition information, informationof a unit depth may be included. Depth information may represent anumber of times or a degree or both in which a unit is partitioned. Asingle unit may be partitioned in a layer associated with depthinformation based on a tree structure. Each of partitioned lower unitmay have depth information. Depth information may be informationrepresenting a size of a CU, and may be stored in each CU.

A partition structure may mean a distribution of a coding unit (CU)within an LCU 310. Such a distribution may be determined according towhether or not to partition a single CU into a plurality (positiveinteger equal to or greater than 2 including 2, 4, 8, 16, etc.) of CUs.A horizontal size and a vertical size of the CU generated bypartitioning may respectively be half of a horizontal size and avertical size of the CU before partitioning, or may respectively havesizes smaller than a horizontal size and a vertical size beforepartitioning according to a number of times of partitioning. The CU maybe recursively partitioned into a plurality of CUs. Partitioning of theCU may be recursively performed until to a predefined depth orpredefined size. For example, a depth of an LCU may be 0, and a depth ofa smallest coding unit (SCU) may be a predefined maximum depth. Herein,the LCU may be a coding unit having a maximum coding unit size, and theSCU may be a coding unit having a minimum coding unit size as describedabove. Partitioning is started from the LCU 310, a CU depth increases by1 as a horizontal size or a vertical size or both of the CU decreases bypartitioning.

In addition, information whether or not the CU is partitioned may berepresented by using partition information of the CU. The partitioninformation may be 1-bit information. All CUs, except for a SCU, mayinclude partition information. For example, when a value of partitioninformation is a first value, the CU may not be partitioned, when avalue of partition information is a second value, the CU may bepartitioned.

Referring to FIG. 3, an LCU having a depth 0 may be a 64×64 block. 0 maybe a minimum depth. A SCU having a depth 3 may be an 8×8 block. 3 may bea maximum depth. A CU of a 32×32 block and a 16×16 block may berespectively represented as a depth 1 and a depth 2.

For example, when a single coding unit is partitioned into four codingunits, a horizontal size and a vertical size of the four partitionedcoding units may be a half size of a horizontal and vertical size of theCU before being partitioned. In one embodiment, when a coding unithaving a 32×32 size is partitioned into four coding units, each of thefour partitioned coding units may have a 16×16 size. When a singlecoding unit is partitioned into four coding units, it may be called thatthe coding unit may be partitioned into a quad-tree form.

For example, when a single coding unit is partitioned into two codingunits, a horizontal or vertical size of the two coding units may be ahalf of a horizontal or vertical size of the coding unit before beingpartitioned. For example, when a coding unit having a 32×32 size ispartitioned in a vertical direction, each of two partitioned codingunits may have a size of 16×32. When a single coding unit is partitionedinto two coding units, it may be called that the coding unit ispartitioned in a binary-tree form. An LCU 320 of FIG. 3 is an example ofan LCU to which both of partitioning of a quad-tree form andpartitioning of a binary-tree form are applied.

FIG. 4 is a view showing an intra-prediction process.

An intra-prediction mode may be a non-angular mode or an angular mode.The non-angular mode may be a DC mode or a planar mode, and the angularmode may be a prediction mode having a specific direction or angle. Theintra-prediction mode may be expressed by at least one of a mode number,a mode value, a mode numeral, and a mode angle. A number ofintra-prediction modes may be M, which is larger than 1, including thenon-angular and the angular mode.

A number of intra-prediction modes may be fixed to N regardless of ablock size. Alternatively, a number of intra-prediction modes may varyaccording to a block size or a color component type or both. Forexample, as a block size becomes large, a number of intra-predictionmodes may increase. Alternatively, a number of intra-prediction modes ofa luma component block may be larger than a number of intra-predictionmodes of a chroma component block.

In order to intra-predict a current block, a step of determining whetheror not samples included in a reconstructed neighbor block may be used asreference samples of the current block may be performed. When a samplethat is not usable as a reference sample of the current block ispresent, a value obtained by duplicating or performing interpolation onat least one sample value among samples included in the reconstructedneighbor block or both may be used to replace with a non-usable samplevalue of a sample, thus the replaced sample value is used as a referencesample of the current block.

When intra-predicting, a filter may be applied to at least one of areference sample and a prediction sample based on an intra-predictionmode and a current block size.

In case of a planar mode, when generating a prediction block of acurrent block, according to a position of a prediction target samplewithin a prediction block, a sample value of the prediction targetsample may be generated by using a weighted sum of an upper and leftside reference sample of a current sample, and a right upper side andleft lower side reference sample of the current block. In addition, incase of a DC mode, when generating a prediction block of a currentblock, an average value of upper side and left side reference samples ofthe current block may be used. In addition, in case of an angular mode,a prediction block may be generated by using an upper side, a left side,a right upper side, and/or a left lower side reference sample of thecurrent block. In order to generate a prediction sample value,interpolation of a real number unit may be performed.

An intra-prediction mode of a current block may be entropyencoded/decoded by predicting an intra-prediction mode of a blockpresent adjacent to the current block. When intra-prediction modes ofthe current block and the neighbor block are identical, information thatthe intra-prediction modes of the current block and the neighbor blockare identical may be signaled by using predetermined flag information.In addition, indicator information of an intra-prediction mode that isidentical to the intra-prediction mode of the current block amongintra-prediction modes of a plurality of neighbor blocks may besignaled. When intra-prediction modes of the current block and theneighbor block are different, intra-prediction mode information of thecurrent block may be entropy encoded/decoded by performing entropyencoding/decoding based on the intra-prediction mode of the neighborblock.

FIG. 5 is a diagram illustrating an embodiment of an inter-pictureprediction process.

In FIG. 5, a rectangle may represent a picture. In FIG. 5, an arrowrepresents a prediction direction. Pictures may be categorized intointra pictures (I pictures), predictive pictures (P pictures), andBi-predictive pictures (B pictures) according to the encoding typethereof.

The I picture may be encoded through intra-prediction without requiringinter-picture prediction. The P picture may be encoded throughinter-picture prediction by using a reference picture that is present inone direction (i.e., forward direction or backward direction) withrespect to a current block. The B picture may be encoded throughinter-picture prediction by using reference pictures that are preset intwo directions (i.e., forward direction and backward direction) withrespect to a current block. When the inter-picture prediction is used,the encoder may perform inter-picture prediction or motion compensationand the decoder may perform the corresponding motion compensation.

Hereinbelow, an embodiment of the inter-picture prediction will bedescribed in detail.

The inter-picture prediction or motion compensation may be performedusing a reference picture and motion information.

Motion information of a current block may be derived duringinter-picture prediction by each of the encoding apparatus 100 and thedecoding apparatus 200. The motion information of the current block maybe derived by using motion information of a reconstructed neighboringblock, motion information of a collocated block (also referred to as acol block or a co-located block), and/or a block adjacent to theco-located block. The co-located block may mean a block that is locatedspatially at the same position as the current block, within a previouslyreconstructed collocated picture (also referred to as a col picture or aco-located picture). The co-located picture may be one picture among oneor more reference pictures included in a reference picture list.

A method of deriving the motion information of the current block mayvary depending on a prediction mode of the current block. For example,as prediction modes for inter-picture prediction, there may be an AMVPmode, a merge mode, a skip mode, a current picture reference mode, etc.The merge mode may be referred to as a motion merge mode.

For example, when the AMVP is used as the prediction mode, at least oneof motion vectors of the reconstructed neighboring blocks, motionvectors of the co-located blocks, motion vectors of blocks adjacent tothe co-located blocks, and a (0, 0) motion vector may be determined asmotion vector candidates for the current block, and a motion vectorcandidate list is generated by using the emotion vector candidates. Themotion vector candidate of the current block can be derived by using thegenerated motion vector candidate list. The motion information of thecurrent block may be determined based on the derived motion vectorcandidate. The motion vectors of the collocated blocks or the motionvectors of the blocks adjacent to the collocated blocks may be referredto as temporal motion vector candidates, and the motion vectors of thereconstructed neighboring blocks may be referred to as spatial motionvector candidates.

The encoding apparatus 100 may calculate a motion vector difference(MVD) between the motion vector of the current block and the motionvector candidate and may perform entropy encoding on the motion vectordifference (MVD). In addition, the encoding apparatus 100 may performentropy encoding on a motion vector candidate index and generate abitstream. The motion vector candidate index may indicate an optimummotion vector candidate among the motion vector candidates included inthe motion vector candidate list. The decoding apparatus may performentropy decoding on the motion vector candidate index included in thebitstream and may select a motion vector candidate of a decoding targetblock from among the motion vector candidates included in the motionvector candidate list by using the entropy-decoded motion vectorcandidate index. In addition, the decoding apparatus 200 may add theentropy-decoded MVD and the motion vector candidate extracted throughthe entropy decoding, thereby deriving the motion vector of the decodingtarget block.

The bitstream may include a reference picture index indicating areference picture. The reference picture index may be entropy-encoded bythe encoding apparatus 100 and then signaled as a bitstream to thedecoding apparatus 200. The decoding apparatus 200 may generate aprediction block of the decoding target block based on the derivedmotion vector and the reference picture index information.

Another example of the method of deriving the motion information of thecurrent may be the merge mode. The merge mode may mean a method ofmerging motion of a plurality of blocks. The merge mode may mean a modeof deriving the motion information of the current block from the motioninformation of the neighboring blocks. When the merge mode is applied,the merge candidate list may be generated using the motion informationof the reconstructed neighboring blocks and/or the motion information ofthe collocated blocks. The motion information may include at least oneof a motion vector, a reference picture index, and an inter-pictureprediction indicator. The prediction indicator may indicateone-direction prediction (L0 prediction or L1 prediction) ortwo-direction predictions (L0 prediction and L1 prediction).

The merge candidate list may be a list of motion information stored. Themotion information included in the merge candidate list may be at leasteither one of the zero merge candidate and new motion information thatis a combination of the motion information (spatial merge candidate) ofone neighboring block adjacent to the current block, the motioninformation (temporal merge candidate) of the collocated block of thecurrent block, which is included within the reference picture, and themotion information exiting in the merge candidate list.

The encoding apparatus 100 may generate a bitstream by performingentropy encoding on at least one of a merge flag and a merge index andmay signal the bitstream to the decoding apparatus 200. The merge flagmay be information indicating whether or not to perform the merge modefor each block, and the merge index may be information indicating thatwhich neighboring block, among the neighboring blocks of the currentblock, is a merge target block. For example, the neighboring blocks ofthe current block may include a left neighboring block on the left sideof the current block, an upper neighboring block disposed above thecurrent block, and a temporal neighboring block temporally adjacent tothe current block.

The skip mode may be a mode in which the motion information of theneighboring block is applied to the current block as it is. When theskip mode is applied, the encoding apparatus 100 may perform entropyencoding on information of the fact that the motion information of whichblock is to be used as the motion information of the current block togenerate a bit stream, and may signal the bitstream to the decodingapparatus 200. The encoding apparatus 100 may not signal a syntaxelement regarding at least any one of the motion vector differenceinformation, the encoding block flag, and the transform coefficientlevel to the decoding apparatus 200.

The current picture reference mode may mean a prediction mode in which apreviously reconstructed region within a current picture to which thecurrent block belongs is used for prediction. Here, a vector may be usedto specify the previously-reconstructed region. Information indicatingwhether the current block is to be encoded in the current picturereference mode may be encoded by using the reference picture index ofthe current block. The flag or index indicating whether or not thecurrent block is a block encoded in the current picture reference modemay be signaled, and may be deduced based on the reference picture indexof the current block. In the case where the current block is encoded inthe current picture reference mode, the current picture may be added tothe reference picture list for the current block so as to be located ata fixed position or a random position in the reference picture list. Thefixed position may be, for example, a position indicated by a referencepicture index of 0, or the last position in the list. When the currentpicture is added to the reference picture list so as to be located atthe random position, the reference picture index indicating the randomposition may be signaled.

A decoder may perform intra prediction on the current block by referringto a pixel of an already reconstructed area (hereinafter, referred to asa “pre-reconstructed area”) before a current block, which is a decodingtarget block. The pre-reconstructed area may be, for example, at leastone of the left, upper, upper left, and upper right neighbor areas ofthe current block. Efficiency in intra prediction is high whensimilarity between the pre-reconstructed area and the current block ishigh. However, in prediction of the current block, even thoughsimilarity between an area not reconstructed yet (hereinafter, referredto as an “unreconstructed area”) and the current block is high, thepixel of the unreconstructed area may not be referenced, and thusefficiency in intra prediction is constrained to be enhanced. Theunreconstructed area may be, for example, at least one of the right,lower, lower left, and lower right neighbor areas of the current block.

As described above, when a pixel that may be referenced does not existin the unreconstructed area, intra prediction where a pixel of theunreconstructed area is referenced is generally impossible. In thepresent invention, a pixel of the unreconstructed area is generated tosolve the problem, and is used in performing intra prediction. Accordingto the present invention, the pixel of the unreconstructed area may begenerated using a pixel of a reference picture. That is, since a pixelof another picture (reference picture) is referenced, intra predictionaccording to the present invention may not have a precise meaning ofintra prediction. However, in intra prediction according to the presentinvention, the operation other than configuration of a reference sampleby using a reference picture is the same as or similar to the generaloperation of intra prediction.

An intra-prediction mode may include a non-directionality mode, such asa DC mode and a PLANAR mode, and a directionality mode (an angularmode). In the description, intra prediction using the pixel of thecurrent picture is defined as Intra_DC prediction, Intra_PLANARprediction, Intra_Angular prediction, etc., and intra prediction usingthe pixel of the reference picture is defined as Inter_DC prediction,Inter_PLANAR prediction, Inter_Angular prediction, etc.

Accordingly, in the description, “intra prediction” is different fromintra prediction in a general sense. General intra prediction and intraprediction according to the present invention may be distinguisheddepending on whether the pixel of the current picture is used or thepixel of the reference picture is used in configuring a referencesample. Also, in intra prediction of the present invention, whenconfiguring a reference sample, at least one of the pixel of currentpicture, the pixel of the reference picture, and the pixel of thegeometric-transform reference picture may be used.

FIG. 6 is a view illustrating the case where similarity between thecurrent block and the unreconstructed area is high.

In the example shown in FIG. 6, similarity between the current block andthe pre-reconstructed area is on average low, and similarity between thecurrent block and the pre-reconstructed area is on average high.Therefore, intra prediction where the pixel of the pre-reconstructedarea is referenced is more advantageous than intra prediction where thepixel of the unreconstructed area is referenced in terms of encodingefficiency. The present invention proposes, for example, a method ofreferring to the pixel of the unreconstructed area in the example shownin FIG. 6.

FIG. 7 is a view illustrating an intra-prediction mode according to thepresent invention.

In the intra-prediction mode where only the pixel of thepre-reconstructed area in the current picture is referenced, asdescribed with reference to FIG. 4, there is only a prediction directionindicating the pixel of the pre-reconstructed area in the currentpicture. However, in intra prediction according to the presentinvention, the pixel of the pre-reconstructed area as well as the pixelat a position corresponding to unreconstructed area may be referenced.Accordingly, as shown in FIG. 7, the intra-prediction mode may include aprediction mode in all directions (omni-direction).

When the intra-prediction mode of the current block indicates a positionof the unreconstructed area, the unreconstructed area in the currentpicture may not be referenced. Therefore, the unreconstructed area maybe configured using a pixel of another picture that is alreadyreconstructed. For example, when the current picture is an interpicture, the unreconstructed area may be configured using a pixel of areference picture. A method of configuring the unreconstructed area byusing a pixel of an already reconstructed picture or a pixel of areference picture will be described in detail later.

FIG. 8 is a view illustrating directionality of an intra-prediction modefor an area having pixel distribution with directionality.

FIG. 8(a) illustrates an example of partitioning an image area havingpixel distribution with particular directionality into units of a block.In FIG. 8(a), adjacent blocks may have the same or similardirectionality. FIG. 8(b) illustrates an example of directionality ofthe intra-prediction mode of each block in the case where only the pixelof the pre-reconstructed area is referenced. FIG. 8(c) illustrates anexample of directionality of the intra-prediction mode of each block inthe case where the pixel of the pre-reconstructed area and the pixel ofthe unreconstructed area are referenced.

In FIGS. 8(b) and 8(c), prediction direction referring to the pixel ofthe pre-reconstructed area is designated as a solid line arrow, andprediction direction referring to the pixel of the unreconstructed areais designated as a broken line arrow. Prediction directions of the solidline arrows or the broken line arrows are equal to each other, and thesolid line arrow and the broken line arrow are symmetrical with respectto the center of the block.

As shown in FIG. 7, the solid line arrow and the broken line arrow inFIG. 8(c) may be indicated as different types of mode information (e.g.,the mode number). In contrast, in FIG. 8(b), mode information of theblocks is the same. Therefore, in FIG. 8(c), selectivity of an MPM (MostProbable Mode) where the prediction mode of the current block is derivedfrom information on the neighbor block is low, compared to the case inFIG. 8(b). That is, as shown in FIG. 8(c), when an omni-directionalprediction mode can be used, selectivity of the MPM is low, and thus thenumber of bits for signaling the prediction mode of the current blockmay be increased.

FIG. 9 is a view illustrating a method of configuring a reference pixelfor intra prediction of the current block by using the pixel of thecurrent picture and the pixel of the reference picture.

FIG. 9(a) illustrates a current picture containing a current block, andFIG. 9(b) illustrates a reference picture. As shown in FIG. 9 (a), intraprediction for the current block may be performed using the pixel of thepre-reconstructed area adjacent to the current block in the currentpicture. Alternatively, as shown in FIG. 9(b), intra prediction for thecurrent block may be performed using the pixel of an area (hereinafter,referred to as “a similar area of the current block”) similar to thecurrent block in the reference picture. The similar area of the currentblock means an area in a reference picture having pixel distributionsimilar to that of the current block. In the description, the similararea of the current block may be indicated as the similar area in thereference picture. The similar area in the reference picture may be anarea in the reference picture having the same size as the current block.Alternatively, the similar area in the reference picture may include anarea in the reference picture having the same size as the current blockand an area adjacent thereto.

The similar area in the reference picture may be specified based onprediction information (e.g., a motion vector) of the current block.Here, in order to estimate prediction information of the current block,prediction information of the neighbor area of the current block may beutilized. Alternatively, the similar area in the reference picture maybe searched by comparing similarity of the neighbor pixel of the currentblock and the pixel in the reference picture. For example, in order tocompare similarity, an area of an original picture corresponding to thecurrent block and an arbitrary area in the reference picture arespecified, and cost may be calculated by applying various cost functionsto pixel values in the areas. For example, in order to comparesimilarity, a neighbor area of the current block and an arbitrary areain the reference picture are specified, and cost may be calculated byapplying various cost functions to pixel values in the areas. Forexample, cost function includes SAD (sum of absolute difference), SSD(sum of squared difference), MAD (mean absolute difference), MSD (meansquared difference), etc. When cost is calculated using the costfunctions, whether two areas are similar may be determined by comparingthe calculated cost and a threshold. Alternatively, the similar area inthe reference picture may be determined by using prediction information(e.g., a motion vector) transmitted from the encoder to the decoder, theprediction information being information for determining an area havinghigh similarity to the current block in the reference picture.

According to the present invention, intra prediction of the currentblock may be performed using the pixel of the similar area in thereference picture in addition to the pre-reconstructed area of thecurrent picture. That is, intra prediction according to the presentinvention may refer to the pixel in the reference picture. Accordingly,intra prediction of the current block according to the present inventionmay be indicated as Inter_DC prediction, Inter_Planar prediction, orInter_Angular prediction.

According to another embodiment of the present invention, the referencepicture is transformed to generate a geometric transform picture (warpedpicture), and the geometric transform picture is used as a referencepicture to determine the similar area. Based on information on geometrictransform, the reference picture of the current picture is reconfiguredto be similar to the current picture such that the geometric transformpicture is generated. The geometric transform picture may be used as areference picture of inter prediction and intra prediction according tothe present invention. The geometric transform reference picture is apicture reconfigured to be similar to the current picture, and thusaccuracy of prediction using the similar area pixel may increase.

The similar area of the current block is likely to be the area in thereference picture having the same position as the current block.Therefore, the area in the reference picture having the same position asthe current block and/or an area adjacent thereto may be determined asthe similar area of the current block.

FIG. 10 is a view illustrating operation of an encoder performing intraprediction according to the present invention.

At step S1010, intra prediction may be performed using, as a referencesample, a pixel of the pre-reconstructed area in the current picture andencoding efficiency may be estimated. At step S1020, intra predictionmay be performed using, as a reference sample, a pixel of the similararea in the reference picture and encoding efficiency may be estimated.Inter_DC prediction, Inter_Planar prediction, and Inter_Angularprediction that may be performed at step S1020 will be described later.

Next, encoding efficiency estimated at steps S1010 and S1020 is comparedat step S1030, and intra prediction may be performed using a method forhigher encoding efficiency at step S1040 or S1050.

At step S1040, additional information may be signaled from the encoderto the decoder, and the additional information includes at least onepiece of information of prediction information containing a predictionmode, information on the similar area, and mode information using thesimilar area. The decoder decodes the signaled additional information toperform intra prediction performed by the encoder. Information on thesimilar area may be equally derived by the encoder and the decoder byusing pixel distribution of a neighbor area of the current block orprediction information of a motion vector, etc. In this case,information on the similar area may not be signaled.

In the example explained referring to FIG. 10, at steps S1020 and S1040,intra prediction is performed using a pixel of the similar area in thereference picture. However, these are not limited thereto, for example,at steps S1020 and S1040, intra prediction may be performed using boththe pixel of the similar area in the reference picture and the pixel ofthe pre-reconstructed area in the current picture.

FIG. 11 is a view illustrating operation of a decoder performing intraprediction according to the present invention.

At step S1110, the decoder may receive additional information from theencoder. The decoder may determine, based on the additional information,whether the current block is predicted using the pixel of the similararea in the reference picture or is predicted using the pixel of thepre-reconstructed area in the current picture at step S1120. Next,depending on the result of determination at step S1120, the decoder mayselectively perform step S1130 or step S1140. When the prediction modeof the current picture can be determined with receiving the additionalinformation, step S1110 in FIG. 11 may be omitted.

Hereinafter, Inter_DC prediction according to the present invention willbe described.

In a DC mode (hereinafter, referred to as “intra DC” or “Intra_DC”)using the pixel of the pre-reconstructed area in the current picture,left and upper neighbor pixels of the current block are used. That is,the current block is predicted using an average value of pixel values ofthe left and upper neighbor pixels. Therefore, when the left and upperneighbor pixels of the current block are similar to the pixel of thecurrent block, accuracy of intra DC prediction is high. In contrast, theleft and upper neighbor pixels of the current block is greatly differentfrom the pixel of the current block, accuracy of intra DC prediction islow. In the latter case, accuracy of prediction may be enhanced usingthe pixel of the similar area in the reference picture.

FIG. 12 is a view illustrating Inter_DC prediction according to thepresent invention.

FIG. 12(a) illustrates a current block and neighbor pixels in thecurrent picture. In FIG. 12(a), broken line areas are areas of the leftand upper neighbor pixels of the current block used in intra DCprediction. Also, the bold solid line area indicates the current block.FIG. 12(b) illustrates a pixel of the similar area in the referencepicture. In FIG. 12(b), the bold solid line area indicates the similararea in the reference picture corresponding to the current block.

For Inter_DC prediction according to the present invention, all or atleast some of pixels of the similar area in the reference picture andthe left and upper neighbor pixels of the current block in the currentpicture may be used as reference pixels. The pixels of the similar areain the reference picture used in Inter_DC prediction of the presentinvention may include the pixel of the bold solid line area and/or anarea adjacent thereto in FIG. 12(b).

Inter_DC prediction may be performed in units of sub-blocks constitutingthe current block. A range of pixels of the similar area in an availablereference picture may vary depending on the position of the sub-block.The size and/or shape of the sub-block may be a fixed size and/or shapepredetermined by the encoder/decoder, or may be derived depending on thesize and/or shape of the current block. A plurality of sub-blocksconstituting the current block may be predicted sequentially accordingto a predetermined priority order. The predetermined priority order maybe from the left to the right or from the top to the bottom.Alternatively, the predetermined priority order may be from the right tothe left or from the bottom to the top. When the first sub-block in thecurrent block is reconstructed before the second sub-block, areconstructed pixel of the first sub-block may be used as a referencepixel for prediction of the second sub-block.

A prediction value of Inter_DC prediction may be derived by using anaverage or a weighted average of available reference pixels. Whenapplying the weighted average, a weighting for each pixel may bedetermined in various ways.

FIG. 13 is a view illustrating an embodiment of Inter_DC predictionaccording to the present invention.

At step S1310, whether the neighbor pixel of the current block in thecurrent picture is available may be determined. The neighbor pixel mayinclude, for example, a pixel adjacent to the left and/or the top of thecurrent block. When the neighbor pixel is available, the availableneighbor pixel may be added as a reference pixel candidate of thecurrent block at step S1320.

When the neighbor pixel is unavailable, the pixel of the similar area inthe reference picture may be added as a reference pixel candidate atstep S1330. A method of specifying the similar area in the referencepicture is the same as described above.

Next, a prediction value of the current block is calculated using thereference pixel candidate such that prediction may be performed at stepS1340. The prediction value may be calculated using an average or aweighted average for all or some of reference pixel candidates.

Also, among pixels in the current block, filtering may be performed on apixel adjacent to the boundary at step S1350. The boundary on whichfiltering is performed may include, for example, at least one of left,top, right, and bottom boundaries of the current block. Filtering on thepixel adjacent to the left and/or top boundary of the current block maybe performed using the pixel of the pre-reconstructed area in thecurrent picture and/or the pixel of the similar area in the referencepicture. Filtering on the pixel adjacent to the right and/or bottomboundary of the current block may be performed using the pixel of thesimilar area in the reference picture.

The embodiment explained referring to FIG. 13 may be effectivelyapplied, when, for example, there is no neighbor pixel to be referencedsince the current block is adjacent to the boundary of a particulararea. The particular area may be, for example, at least one of apicture, a slice, and a tile. Alternatively, the particular area may bean area that is the unit of parallel processing. When the area is theunit of parallel processing, the area is encoded so as not to refer toinformation on the neighbor area.

The embodiment explained referring to FIG. 13 is equally performed bythe encoder and the decoder based on whether the neighbor pixel of thecurrent block is available, and thus additional information required inprediction may not be transmitted.

In the embodiment explained referring to FIG. 13, after determiningwhether the neighbor pixel of the current block is available, thereference pixel is differently used. However, regardless of whether theneighbor pixel of the current block is available, Inter_DC predictionmay be performed using only the pixel of the similar area in thereference picture. In this case, information indicating that Inter_DCprediction has been performed (e.g., mode information) and/orinformation on the similar area (e.g., position information) may besignaled from the encoder to the decoder.

Also, in the embodiment explained referring to FIG. 13, only one of theneighbor pixel of the current block and the pixel of the similar area inthe reference picture can be referenced, but without being limitedthereto, both the available neighbor pixel of the current block and thepixel of the similar area in the reference picture may be used asreference pixel candidates. Also, all or some of reference pixelcandidates may be used for prediction.

FIG. 14 is a view illustrating operation of an encoder performinganother embodiment of DC prediction according to the present invention.

At step S1410, the encoder may search the similar area of the currentblock in the reference picture. A method of specifying the similar areain the reference picture is the same as described above.

At step S1420, whether the pixel of the similar area is similar to thepixel of the current block may be determined. When pixel similaritybetween two areas is high, Inter_DC prediction is performed at stepS1430. When the pixel similarity is low, Intra_DC prediction isperformed at step S1440. A method of comparing similarity between pixelsis the same as described above.

In the embodiment explained referring to FIG. 14, based on similaritybetween areas, the encoder and the decoder may perform the same process,and thus information on a mode may not be transmitted.

At step S1420 in FIG. 14, similarity between the similar area in thereference picture and the current block is determined. However, withoutbeing limited thereto, similarity between the pre-reconstructed area andthe current block in the current picture may be determined. For example,when similarity between the pre-reconstructed area and the current blockin the current picture is high, Intra_DC prediction is performed. Whenthe similarity is low, Inter_DC prediction is performed.

Alternatively, similarity of both cases may be determined. In this case,similarity between two cases is compared. When the current block is moresimilar to the pre-reconstructed area in the current picture rather thanto the similar area in the reference picture, Intra_DC prediction isperformed, and otherwise, Inter_DC prediction is performed.

As another embodiment, the encoder may perform both Intra_DC predictionand Inter_DC prediction, and after comparing encoding efficiency of twoprediction modes, prediction having higher encoding efficiency may bedetermined. In this case, information on prediction may be signaled fromthe encoder to the decoder. When Inter_DC prediction is selected,information on prediction may include information on the similar areaand/or information on the selected mode.

FIG. 15 is a view illustrating Inter_Planar prediction according to thepresent invention.

FIG. 15(a) illustrates an example of planar prediction (hereinafter,referred to as “Intra_Planar prediction”) using only the pixel of thepre-reconstructed area in the current picture. FIG. 15(b) illustrates anexample of Inter_Planar prediction using the pixel of thepre-reconstructed area in the current picture and/or the pixel of thesimilar area in the reference picture. In the case of Inter_Planarprediction, all or some of pixels of the pre-reconstructed area in thecurrent picture and the similar area in the reference picture may beused in performing prediction.

In FIG. 15, the gray-colored pixel indicates a reference pixel that maybe used in Intra_Planar prediction or Inter_Planar prediction. Also, theblack-colored pixel indicates a pixel in the current block that is atarget of prediction.

In Intra_Planar prediction, only the pixel of the pre-reconstructed areain the current picture (e.g., the left and/or upper neighbor pixel ofthe current block) is used in predicting the current block. Therefore,when similarity between the pixel of the pre-reconstructed area in thecurrent picture and the pixel of the current block is low, accuracy ofprediction decreases and encoding efficiency is low. In this case, thepixel of the similar area in the reference picture is used, such thataccuracy of prediction may be enhanced.

In planar prediction, a weighting may be applied based on the distancebetween the reference pixel and a current prediction target pixel. Forexample, a high weighting may be applied to the near reference pixel.Alternatively, different weightings may be applied to the referencepixel in the current picture and the reference pixel in the referencepicture. For example, a high weighting may be applied to the referencepixel having high similarity to the current block. That is, when thecurrent block is more similar to the similar area in the referencepicture rather than to the pre-reconstructed area in the currentpicture, a higher weighting may be applied to the reference pixel in thereference picture.

FIG. 16 is a view illustrating another embodiment of the reference pixelthat may be used in Inter_Planar prediction.

As shown in FIG. 16, the reference pixel in the reference picture may bea pixel in a particular area including a current prediction targetpixel.

According to an embodiment of a planar prediction method of the presentinvention, Intra_Planar prediction and Inter_Planar prediction areperformed and encoding efficiency of two prediction modes may becompared. When encoding efficiency in Intra_Planar prediction is higher,Intra_Planar prediction may be performed on the current block. Whenencoding efficiency in Inter_Planar prediction is higher, Inter_Planarprediction may be performed on the current block. When Inter_Planarprediction is performed on the current block, information on predictionmay be signaled from the encoder to the decoder.

The embodiment of Inter_DC prediction explained referring to FIG. 14 maybe applied in Inter_Planar prediction. That is, the similar area in thereference picture is searched and similarity between the similar areaand the current block is determined. When similarity is high,Inter_Planar prediction is performed. When similarity is low,Intra_Planar prediction is performed.

FIG. 17 is a view illustrating another embodiment of Inter_Planarprediction according to the present invention.

At step S1710, the similar area in the reference picture may besearched. A method of searching the similar area is the same asdescribed above. When the similar area is determined, similarity betweenthe similar area and the current block may be determined at step S1720.A method of comparing similarity between pixels is the same as describedabove.

Based on the result of similarity determination at step S1720, the firstweighting may be applied to the pixel of the similar area in thereference picture and the second weighting may be applied to the pixel(e.g., the left and/or upper neighbor pixel of the current block) of thepre-reconstructed area in the current picture. The first weighting andthe second weighting may be the same or different from each other. Forexample, when similarity between the similar area and the current blockis high, the first weighting may be larger than the second weighting atstep S1730. Otherwise, the second weighting may be larger than the firstweighting at step S1740. When the weighting is determined, Inter_Planarprediction may be performed based on the reference pixel and theweighting at step S1750.

Inter_Planar prediction explained referring to FIG. 17 may be equallyperformed by the encoder and the decoder based on the result ofsimilarity determination, and thus information on the prediction modemay not be signaled.

Hereinafter, Inter_Angular prediction according to the present inventionwill be described.

As described referring to FIG. 7, since intra prediction according tothe present invention may refer to the pixel of the similar area in thereference picture, omni-directional prediction may be performed.

When pixel similarity between the pre-reconstructed area and the currentblock in the current picture is low, accuracy of prediction referring tothe pixel of the pre-reconstructed area in the current picture maydecrease. In this case, prediction is performed referring to the pixelof the similar area in the reference picture, such that accuracy ofprediction may be enhanced. Since the pixel in the reference picture isalready reconstructed at the time of prediction of the current block,omni-directional prediction is possible as described later.

FIG. 18 is a view illustrating another embodiment of an omni-directionalprediction mode according to the present invention. In FIG. 18, thedirection of the arrow indicates the position of the reference pixel,and a number at the head of each arrow indicates a symbol (a modenumber) of an intra prediction direction.

As shown in FIG. 18, the same symbols may be assigned to 180-degreeopposite prediction directions. As described above, in omni-directionalprediction, when the symbol indicating a prediction direction (aprediction mode) is large, selectivity of the MPM is low and encodingefficiency of the prediction mode is likely to be decreased. As shown inFIG. 18, the same symbols are assigned to opposite predictiondirections, such that the problem of decreased encoding efficiency ofthe prediction mode caused by the increased number of symbols indicatingprediction modes may be solved. In this case, one prediction directionmay be specified based on information indicating whether the pixel(e.g., the pixel of the left or upper area) of the pre-reconstructedarea in the current picture is used or the pixel (e.g., the pixel of theright or lower area) at the position of the unreconstructed area isused. For example, the 18th symbol may indicate a left-upward diagonaldirection or a right-downward diagonal direction. When the pixel at theposition of the unreconstructed area is used as the reference pixel ofthe current block, the 18th symbol may be specified as a right-downwarddiagonal direction.

FIG. 19 is a view illustrating directionality prediction referring tothe pixel of the similar area in the reference picture.

FIGS. 19(a) and 19(c) are views illustrating directionality intraprediction (hereinafter, referred to as “Intra_Angular prediction”)referring to only the pixel of the pre-reconstructed area in the currentpicture. FIGS. 19(b) and 19(d) are views illustrating directionalityintra prediction (Inter_Angular prediction) referring to the pixel ofthe pre-reconstructed area in the current picture and the pixel of thesimilar area in the reference picture.

In FIG. 19, the dot-patterned area means the pre-reconstructed area(e.g., the left and/or upper neighbor area of the current block) in thecurrent picture. The slash-patterned area means the pixel area (e.g.,the similar area in the reference picture) in the reference picture, andthe gray-colored area means an area having no pixel information that maybe referenced. The solid line arrow indicates a direction for predictionreferring to the pixel of the pre-reconstructed area in the currentpicture, and the bold solid line arrow indicates a direction forprediction referring to the pixel in the reference picture. The brokenline arrow indicates a direction where prediction is impossible.

As shown in FIG. 19(a), in Intra_Angular prediction, since only thepixel of the pre-reconstructed area in the current picture isreferenced, the left and/or upper neighbor pixel of the current blockmay be referenced. However, as shown in FIG. 19(c), prediction may notbe performed on an area that is not reconstructed yet.

In Inter_Angular prediction, since the pixel of the pre-reconstructedarea in the current picture and the pixel in the reference picture arereferenced, omni-directional prediction is possible as shown in FIGS.19(b) and (d).

The pixel in the reference picture that may be used in Inter_Angularprediction of the present invention may be all or some of the pixel ofthe similar area in the reference picture.

FIG. 20 is a view illustrating Inter_Angular prediction according toanother embodiment of the present invention. As shown in FIG. 20, thepixel of the similar area in the reference picture and all or some ofneighbor pixels may be used in Inter_Angular prediction.

The encoder performing Inter_Angular prediction of the present inventionmay perform both Intra_Angular prediction and Inter_Angular predictionand may measure encoding efficiency. Next, encoding efficiency in bothprediction is compared, and prediction having better encoding efficiencymay be performed. Alternatively, a weighted sum of Intra_Angularprediction and Inter_Angular prediction may be used in prediction.

After prediction, filtering may performed on the current block.Filtering for the current block may include the first filtering for theleft and/or top boundary area of the current block and the secondfiltering for the right and/or bottom boundary area of the currentblock. The first filtering may be performed based on at least one of thepixel of the pre-reconstructed area in the current picture (the leftand/or upper neighbor pixel of the current block) and the pixel of thereference picture. The second filtering may be performed based on thepixel of the pre-reconstructed area in the current picture (the leftand/or upper neighbor pixel of the current block) or the pixel of thereference picture. The first filtering and the second filtering may bedifferent in at least one of the position, the number, and the range ofpixels used in filtering.

FIG. 21 is a view illustrating operation of a decoder performing angularprediction according to the present invention.

The encoder according to the present invention may signal predictioninformation indicating Inter_Angular prediction has been performed tothe decoder. The decoder receives prediction information at step S2110,and determines whether Inter_Angular prediction or Intra_Angularprediction is performed on the current block at step S2120, and performsInter_Angular prediction at step S2130 or Intra_Angular prediction atstep S2140.

FIG. 22 is a view illustrating a filtering method using the pixel of thesimilar area in the reference picture.

In FIG. 22, the bold solid line square area means an area of the currentblock. The horizontal line patterned area means the left and/or upperneighbor pixel of the current block referenced in filtering, and thevertical line patterned area means the right and/or lower neighbor pixelof the similar area in the reference picture referenced in filtering.The gray-colored area means an area where filtering is performed.

FIG. 22(a) is a view illustrating filtering using only the pixel in thecurrent picture, as a filtering method performed in Intra_DC prediction.FIG. 22(c) is a view illustrating filtering using only the pixel in thecurrent picture, as a filtering method performed in Intra_Angularprediction. As shown in FIGS. 22(a) and (c), only the pixel of thepre-reconstructed area in the current picture (e.g., the left and/orupper neighbor pixel of the current block) is used in performingfiltering.

In FIGS. 22(b) and (d), filtering may be performed by referring to boththe pixel of the pre-reconstructed area in the current picture (e.g.,the left and/or upper neighbor pixel of the current block) and the pixelof the similar area in the reference picture (e.g., the right and/orlower neighbor pixel of the current block).

FIG. 22(b) illustrates a method of performing filtering only on theboundary area of the current block. In this case, filtering for the leftand/or top boundary area of the current block may be performed using theleft and/or upper neighbor pixel of the current block. Also, filteringfor the right and/or bottom boundary area of the current block may beperformed using the right and/or lower neighbor pixel of the similararea in the reference picture.

FIG. 22(d) illustrates a method of performing filtering on all innerareas of the current block. In this case, the left neighbor pixel andthe upper neighbor pixel of the current block and all or some of pixelsof the similar area in the reference picture may be used. Differentweightings may be assigned to reference pixels used in filtering. Theweighting may be determined based on a distance between a filteringtarget pixel and a reference pixel. Alternatively, depending on whetherthe reference pixel is in the current picture or in the referencepicture, different weighting may be assigned.

Filtering explained with reference to FIGS. 22(b) and (d) may beperformed on a block where Inter_DC prediction, Inter_Planar prediction,or Inter_Angular prediction is performed. However, at least one of fourtypes of filtering explained with reference to FIG. 22 may be applied toa block where Intra_DC prediction, Inter_DC prediction, Intra_Planarprediction, Inter_Planar prediction, Intra_Angular prediction, orInter_Angular prediction is performed.

FIG. 23 is a view illustrating an embodiment of an encoder performingprediction using the pixel of the similar area in the reference picture.

The encoder as shown in FIG. 23 according to the present invention mayinclude a transform unit 2301, a quantization unit 2302, an entropyencoder 2303, an inverse quantization unit 2304, an inverse transformunit 2305, a deblocking filter 2306, a sample adaptive offset filter2307, a decoded picture buffer (DPB) 2308, an intra-prediction unit2309, an inter-prediction unit 2310, and/or a similar area pixelprediction unit 2311.

The similar area pixel prediction unit 2311 receives the referencepicture from the decoded picture buffer 2308 and performs prediction onthe current picture. When the current picture is an inter picture, thesimilar area pixel prediction unit 2311 may perform Inter_DC prediction,Inter_Planar prediction, and/or Inter_Angular prediction by referring tothe pixel of the similar area in the reference picture. In FIG. 23, thesimilar area pixel prediction unit 2311 is shown as a separatecomponent, but may be included in the intra-prediction unit 2309 or theinter-prediction unit 2310. Alternatively, a function of the similararea pixel prediction unit 2311 may be realized by extending theconfiguration of the intra-prediction unit 2309 or the inter-predictionunit 2310.

FIG. 24 is a view illustrating another embodiment of an encoderperforming prediction using the pixel of the similar area in thereference picture.

The encoder shown in FIG. 24 according to the present invention mayinclude a transform unit 2401, a quantization unit 2402, an entropyencoder 2403, an inverse quantization unit 2404, an inverse transformunit 2405, a deblocking filter 2406, a sample adaptive offset filter2407, a decoded picture buffer (DPB) 2408, an intra-prediction unit2409, an inter-prediction unit 2410, a similar area pixel predictionunit 2411, a geometric transform picture generation unit 2412, and/or ageometric transform picture buffer 2413.

The similar area pixel prediction unit 2411 respectively receives areference picture and/or a geometric-transform reference picture fromthe decoded picture buffer 2408 and/or the geometric transform picturebuffer 2413, and performs prediction on the current picture. When thecurrent picture is an inter picture, the similar area pixel predictionunit 2411 may perform Inter_DC prediction, Inter_Planar prediction,and/or Inter_Angular prediction by referring to the reference pictureand/or the geometric-transform pixel of the similar area in thereference picture. In FIG. 24, the similar area pixel prediction unit2411 is shown as a separate component, but may be included in theintra-prediction unit 2409 or the inter-prediction unit 2410.Alternatively, a function of the similar area pixel prediction unit 2411may be realized by extending the configuration of the intra-predictionunit 2409 or the inter-prediction unit 2410.

FIG. 25 is a view illustrating an embodiment of a decoder performingprediction using the pixel of the similar area in the reference picture.

The decoder shown in FIG. 25 according to the present invention mayinclude an entropy decoder 2501, an inverse quantization unit 2502, aninverse transform unit 2503, a filter unit 2504, an intra-predictionunit 2505, an inter-prediction unit 2506, a decoded picture buffer 2507,and/or a similar area pixel prediction unit 2508.

The similar area pixel prediction unit 2508 receives a reference picturefrom the decoded picture buffer 2507 and performs prediction on thecurrent picture. The similar area pixel prediction unit 2508 mayperform, based on prediction information received from the encoder,Inter_DC prediction, Inter_Planar prediction, and/or Inter_Angularprediction referring to the pixel of the similar area in the referencepicture. In FIG. 25, the similar area pixel prediction unit 2508 isshown as a separate component, but may be included in theintra-prediction unit 2505 or the inter-prediction unit 2506.Alternatively, a function of the similar area pixel prediction unit 2508may be realized by extending the configuration of the intra-predictionunit 2505 or the inter-prediction unit 2506.

FIG. 26 is a view illustrating another embodiment of a decoderperforming prediction using the pixel of the similar area in thereference picture.

The decoder shown in FIG. 26 according to the present invention mayinclude an entropy decoder 2601, an inverse quantization unit 2602, aninverse transform unit 2603, a filter unit 2604, an intra-predictionunit 2605, an inter-prediction unit 2606, a decoded picture buffer 2607,a similar area pixel prediction unit 2608, a geometric transform picturegeneration unit 2609, and/or a geometric transform picture buffer 2610.

The similar area pixel prediction unit 2608 respectively receives areference picture and/or a geometric-transform reference picture fromthe decoded picture buffer 2607 and/or the geometric transform picturebuffer 2610, and performs prediction on the current picture. The similararea pixel prediction unit 2608 may perform, based on predictioninformation received from the encoder, Inter_DC prediction, Inter_Planarprediction, and/or Inter_Angular prediction referring to the referencepicture and/or the pixel of the similar area in the geometric-transformreference picture. In FIG. 26, the similar area pixel prediction unit2608 is shown as a separate component, but may be included in theintra-prediction unit 2605 or the inter-prediction unit 2606.

FIG. 27 is a view illustrating an embodiment of an apparatus forperforming Inter_DC prediction in a similar area pixel prediction unitaccording to the present invention.

The similar area pixel prediction unit may include an Inter_DCprediction unit performing Inter_DC prediction. As shown in FIG. 27, theInter_DC prediction unit may include a reference sample configurationunit 2701, a prediction value generation unit 2702, an Inter_DCprediction execution unit 2703, and/or a filter unit 2704.

The reference sample configuration unit 2701 receives the currentpicture, the reference picture, and/or the current block as input andconfigures a reference sample required in Inter_DC prediction.Information on the current block input to the reference sampleconfiguration unit 2701 may be, for example, the position of the currentblock and/or the size of the current block in the current picture. Thereference picture may be a picture reconstructed before the currentpicture and/or a geometric transform picture of a reference picturegenerated based on geometric transform information. The reference sampleconfiguration unit 2701 determines the similar area in the referencepicture, and configures the reference sample by using the pixel of thepre-reconstructed area in the current picture (e.g., the left and/orupper neighbor pixel of the current block) and the pixel of the similararea in the reference picture. The reference sample configuration unit2701 may perform reference sample filtering if necessary.

The prediction value generation unit 2702 may calculate a predictionvalue based on the configured reference sample. In order to calculatethe prediction value, an average or a weighted average of the configuredreference samples may be used. The Inter_DC prediction execution unit2703 may perform Inter_DC prediction on the current block based on thecalculated prediction value.

The filter unit 2704 may perform filtering on an Inter_DC predictedblock. For example, filtering may be performed on the boundary area oran inner area of the current block. For filtering, the pixel of thepre-reconstructed area in the current picture (e.g., the left and/orupper neighbor pixel of the current block), the pixel of the similararea in the reference picture, and/or the pixel of the predicted currentblock may be used.

The Inter_DC prediction unit may output the finally predicted currentblock and/or prediction information. The predicted current block may bea prediction block generated using the reference sample. The predictioninformation includes information generated through Inter_DC prediction,for example, information on a residual signal and/or the similar area.

FIG. 28 is a view illustrating an embodiment of an apparatus forperforming Inter_Planar prediction in a similar area pixel predictionunit according to the present invention.

The similar area pixel prediction unit may include an Inter_Planarprediction unit performing Inter_Planar prediction. As shown in FIG. 28,the Inter_Planar prediction unit may include a reference sampleconfiguration unit 2801, an Inter_Planar prediction execution unit 2802,and/or a filter unit 2803. The reference sample configuration unit 2801may perform reference sample filtering if necessary.

In FIGS. 27 and 28, description of common input, output, and aconfiguration unit is the same, and difference will be mainly describedbelow.

The prediction information, which is output of Intra_Planar prediction,includes information generated through Inter_Planar prediction, forexample, information on a residual signal, on the similar area, and/oron a weighting.

The Inter_Planar prediction execution unit 2802 may predict the currentblock by using reference samples received from the reference sampleconfiguration unit 2801. For Inter_Planar prediction, all pixels in thereference picture may be used as reference samples.

FIG. 29 is a view illustrating an embodiment of an apparatus forperforming Inter_Angular prediction in a similar area pixel predictionunit according to the present invention.

The similar area pixel prediction unit may include an Inter_Angularprediction unit performing Inter_Angular prediction. As shown in FIG.29, the Inter_Angular prediction unit may include a reference sampleconfiguration unit 2901, a reference sample filter unit 2902, anInter_Angular prediction execution unit 2903, and/or a filter unit 2904.

In FIGS. 27 and 29, description of common input, output, and aconfiguration unit is the same, and difference will be mainly describedbelow.

The prediction information, which is output of Intra_Angular prediction,includes information generated through Intra_Angular prediction, forexample, information on a residual signal, on the similar area, and/oron a prediction direction.

The reference sample configuration unit 2901 may use all pixels in thereference picture and/or the geometric-transform reference picture asreference samples. Therefore, reference samples may be configured usingall or some of pixels of the reference picture, and/or all or some ofthe right and lower pixels of the current block in thegeometric-transform reference picture, and/or all or some of pixels inthe current block.

The reference sample filter unit 2902 may perform filtering on thereference sample configured by the reference sample configuration unit2901 so as to remove a high-frequency component existing in theconfigured reference sample. The reference sample filter unit 2902 mayperform filtering on the reference sample by using all or some of pixelsin the current picture and/or the geometric-transform reference picture.

Table 1 is an example of a syntax structure of a coding unit (CU) forsignaling whether Inter_DC prediction has been performed.

TABLE 1 coding_uint(x0, y0, log2CbSize) { Descriptor . . . Inter_DC_flagae(v) if(Inter_DC_flag){ Inter_DC_prediction (...) . . .

Table 2 is an example of a syntax structure of a coding unit (CU) forsignaling whether Inter_Planar prediction has been performed.

TABLE 2 coding_uint(x0, y0, log2CbSize) { Descriptor . . .Inter_Planar_flag ae(v) if(Inter_Planar_flag){ Inter_Planar_prediction(...) . . .

Table 3 is an example of a syntax structure of a coding unit (CU) forsignaling whether Inter_Angular prediction has been performed.

TABLE 3 coding_uint(x0, y0, log2CbSize) { Descriptor . . .Inter_Angular_flag ae(v) if(Inter_Angular_flag){Inter_Angular_prediction (...) . . .

Table 4 is an example of a syntax structure of a prediction unit (PU)for signaling whether Inter_DC prediction has been performed.

TABLE 4 prediction_uint(x0, y0, nPbW, nPbH) { Descriptor . . .Inter_DC_flag ae(v) if(Inter_DC_flag){ Inter_DC_prediction (...) . . .

Table 5 is an example of a syntax structure of a prediction unit (PU)for signaling whether Inter_Planar prediction has been performed.

TABLE 5 prediction_uint(x0, y0, nPbW, nPbH) { Descriptor . . .Inter_Planar_flag ae(v) if(Inter_Planar_flag){ Inter_Planar_prediction(...) . . .

Table 6 is an example of a syntax structure of a prediction unit (PU)for signaling whether Inter_Angular prediction has been performed.

TABLE 6 prediction_uint(x0, y0, nPbW, nPbH) { Descriptor . . .Inter_Angular_flag ae(v) if(Inter_Angular_flag){Inter_Angular_prediction (...) . . .

Table 7 is an example of a syntax structure of a coding unit forsignaling whether prediction using the pixel of the similar area in thereference picture has been performed.

TABLE 7 coding_uint(x0, y0, log2CbSize) { Descriptor . . .Using_SimilarArea_Refer_Mode_flag ae(v)if(Using_SimilarArea_Refer_Mode_flag){ SimilarArea_Refer_prediction(...) } else{ Intra_prediction (...) . . .

Table 8 is an example of a syntax structure of a prediction unit forsignaling whether prediction using the pixel of the similar area in thereference picture has been performed.

TABLE 8 prediction_uint(x0, y0, nPbW, nPbH) { Descriptor . . .Using_SimilarArea_Refer_Mode_flag ae(v)if(Using_SimilarArea_Refer_Mode_flag){ SimilarArea_Refer_prediction(...) } else{ Intra_prediction (...) . . .

As shown in tables 1 and 4, information (Inter_DC_flag) indicatingwhether Inter_DC prediction is performed on the current block may besignaled at the CU and/or PU level. The decoder may parse theInter_DC_flag included in the CU or the PU, and may determine whetherInter_DC prediction is performed on the current block. For example, whenthe Inter_DC_flag indicates true, Inter_DC prediction may be performedon the current block. When the Inter_DC_flag indicates false, Intra_DCprediction may be performed on the current block.

As shown in tables 2 and 5, information (Inter_Planar_flag) indicatingwhether Inter_Planar prediction is performed on the current block may besignaled at the CU and/or PU level. The decoder may parse theInter_Planar_flag included in the CU or the PU, and may determinewhether Inter_Planar prediction is performed on the current block. Forexample, when the Inter_Planar_flag indicates true, Inter_Planarprediction is performed on the current block. When the Inter_Planar_flagindicates false, Intra_Planar prediction is performed on the currentblock.

As shown in tables 3 and 6, information (Inter_Angular_flag) indicatingwhether Inter_Angular prediction is performed on the current block maybe signaled at the CU and/or PU level. The decoder may parse theInter_Angular_flag included in the CU or the PU, and may determinewhether Inter_Angular prediction is performed on the current block. Forexample, when the Inter_Angular_flag indicates true, Inter_Angularprediction is performed on the current block. When theInter_Angular_flag indicates false, Intra_Angular prediction isperformed on the current block.

As shown in tables 7 and 8, according to the present invention,information (Using_SimilarArea_Refer_Mode_flag) indicating whetherprediction using the pixel of the similar area in the reference pictureis performed may be signaled at the CU and/or PU level. In tables 7 and8, an algorithm and/or a syntax structure according to the presentinvention may be defined as SimilarArea_Refer_prediction( ) The decodermay parse the Using_SimilarArea_Refer_Mode_flag, and may determinewhether prediction using the pixel of the similar area in the referencepicture is performed on the current block. For example, when theUsing_SimilarArea_Refer_Mode_flag indicates true, theSimilarArea_Refer_prediction( ) may be decoded and prediction using thepixel of the similar area in the reference picture may be performed.When the Using_SimilarArea_Refer_Mode_flag indicates false, predictionusing the pixel of the pre-reconstructed area in the current picture maybe performed.

Tables 1 to 8 show the cases of signaling information (flag) on aprediction mode using the pixel of the similar area in the referencepicture. However, as described above, information on the prediction modemay not signaled when the encoder and the decoder can derive theprediction mode of the current block through the same method withoutsignaling the information.

Hereinafter, among methods of configuring a reference sample of intraprediction, reference sample padding and reference sample filtering willbe described in detail.

FIG. 30 is a view illustrating reference sample padding.

In FIG. 30(a), a reference pixel required in intra prediction of thecurrent block does not exist. In this case, as shown in FIG. 30(d),reference sample padding may be performed with a value of 1<<(the bitdepth−1). For example, when all reference pixel required in intraprediction of the current block do not exist in an 8-bit image, allreference samples may be padded using a pixel value of 1<<(8−1)=128.

In FIG. 30(b), some reference pixels required in intra prediction of thecurrent block do not exist. In this case, as shown in FIG. 30(e), thereference samples may be padded using a reference pixel value of an areaclosest to the area where a reference pixel does not exist.

As shown in FIG. 30(c), when all reference pixels required in intraprediction of the current block exist, reference sample padding may notbe performed as shown in FIG. 30(f).

FIG. 31 is a view illustrating reference sample filtering.

In FIG. 31, the bold broken line area is a reference sample area. FIG.31(a) illustrates pixels of a reference sample before filtering thereference sample, and FIG. 31(b) illustrates pixels of a filteredreference sample. In FIG. 31(a), since the pixels in the referencesample area use reconstructed pixel values rather than original pixelvalues, errors may occur during a reconstruction process. Therefore, byperforming filtering between reference samples, such as linearfiltering, and bilinear filtering, errors contained in the referencesample may be mitigated. As shown in FIG. 31(b), after reference samplefiltering, errors contained in the reference sample may be mitigated.

Since all or some of reference samples do not exist, in performingreference sample padding, when similarity between the original pixel ofan area where a reference sample does not exist and a pixel used inreference sample padding is low, accuracy of intra prediction maydecrease. Considering this case, reference sample padding may beperformed using the pixel of the reference picture.

FIG. 32 is a view illustrating a method of performing reference samplepadding using the pixel of the reference picture when some referencepixels of intra prediction do not exist. In the example shown in FIG.32, reference sample padding is performed on lower left and upper rightareas of the current block. However, without being limited thereto, forexample, reference sample padding may be performed on at least one ofleft, lower left, upper left, upper, and upper right areas.

FIG. 33 is a view illustrating a method of performing reference samplepadding using the pixel of the reference picture when all referencepixels of intra prediction do not exist.

As shown in FIG. 32(a) and FIG. 33(a), when a reference pixel requiredin intra prediction of the current block does not exist, the referencepixel that does not exist may be padded using the pixel of the referencepicture. For example, the similar area in the reference picture isspecified, and the pixel in the reference picture, such as a neighborpixel adjacent to the specified similar area and/or the pixel in thesimilar area, may be used so as to perform reference sample padding. Thespecifying of the similar area in the reference picture may be performedby the above-described method. That is, the similar area in thereference picture may be specified by using a motion vector or pixeldistribution near the current block, or encoding/decoding information onthe position of the similar area. Also, the geometric-transformreference picture may be used instead of the reference picture.

FIG. 32(b) and FIG. 33(b) illustrate reference pixels after referencesample padding using the pixel of the reference picture. A referencepixel closer to the original image may be generated and accuracy ofprediction may be enhanced, compared to the case when padding isperformed using the closest available pixel value or a bit depth-basedpixel value.

As explained above with reference to FIGS. 32 and 33, a process ofpadding a reference sample required in intra prediction by using thepixel of the reference picture is referred to as “inter reference samplepadding” below. In contrast, reference sample padding described withreference to FIG. 30 is referred to as “intra reference sample padding”below.

Reference sample padding according to the present invention may beadaptively performed depending on the type of the current picture. Forexample, when the current picture is an inter picture, inter referencesample padding may be performed. When the current picture is not theinter picture, intra reference sample padding may be performed.

Also, reference sample padding according to the present invention may beperformed selectively as one of inter reference sample padding and intrareference sample padding. Alternatively, reference sample padding may beperformed in a combination thereof.

FIG. 34 is a view illustrating an embodiment of reference sample paddingaccording to the present invention.

First, at step S3410, a reference sample may be configured. At stepS3420, whether an area where a reference sample does not exist, i.e., anunavailable reference sample, exists may be determined. When theunavailable reference sample exists, whether the current picture is aninter picture may be determined at step S3430. When the current pictureis the inter picture, the similar area in the reference picture issearched at step S3440 and inter reference sample padding is performedat step S3450. When the current picture is not the inter picture, intrareference sample padding is performed at step S3460.

When the unavailable reference sample does not exist or reference samplepadding is completed, reference sample filtering is performed at stepS3470, and then, intra prediction is performed at step S3480.

FIG. 35 is a view illustrating reference sample filtering using thepixel of the reference picture.

FIG. 35(a) illustrates an example of pixels in the reference pictureused in reference sample filtering.

FIG. 35(b) illustrates an example of a reference sample after referencesample filtering.

As shown in FIG. 35(a), the similar area in the reference picture isspecified, and the pixel in the reference picture, such as a neighborpixel adjacent to the specified similar are and/or the pixel in thesimilar area, is used so as to perform reference sample padding. Thespecifying of the similar area in the reference picture may be performedby the above-described method. Also, the geometric-transform referencepicture may be used instead of the reference picture.

As described above with reference to FIG. 35, a process of filtering thereference sample required in intra prediction by using the pixel of thereference picture is referred to as “inter reference sample filtering”below. In contrast, reference sample filtering described with referenceto FIG. 31 is referred to as “intra reference sample filtering” below.

FIG. 36 is a view illustrating an embodiment of reference samplefiltering according to the present invention.

First, at step S3610, the reference sample may be configured. At stepS3620, whether the unavailable reference sample exists may bedetermined. When the unavailable reference sample exists, whether thecurrent picture is the inter picture is determined at step S3630. StepS3630 may or may not be performed selectively. When the current pictureis the inter picture, the similar area in the reference picture issearched at step S3640, inter reference sample padding at step S3650 andinter reference sample filtering at step S3660 are performed, and intraprediction is performed at step S3690. When the current picture is notthe inter picture, intra reference sample padding at step S3670 andintra reference sample filtering at step S3680 are performed, and intraprediction is performed at step S3690.

Filters used in inter reference sample filtering and intra referencesample filtering may be the same or different. A different filter meansthat at least one of the number of taps of the filter, a filtercoefficient, a filtering direction, and a filter type is differ. One ofthe filters may be a bidirectional filter in vertical and horizontaldirections, and only one of inter reference sample padding at step S3650and inter reference sample filtering at step S3660 may be performed. Forexample, when the unavailable reference sample exists, intra referencesample padding may be performed. When the current picture is the interpicture, searching of the similar area in the reference picture andinter reference sample filtering may be performed.

Alternatively, after inter reference sample padding, intra referencesample filtering may or may not be performed.

FIG. 37 is a view illustrating an embodiment of an encoder that performsinter reference sample padding and/or inter reference sample filteringaccording to the present invention.

The encoder shown in FIG. 37 according to the present invention mayinclude a transform unit 3701, a quantization unit 3702, an entropyencoder 3703, an inverse quantization unit 3704, an inverse transformunit 3705, a deblocking filter 3706, a sample adaptive offset filter3707, a decoded picture buffer (DPB) 3708, an inter reference sampleconfiguration unit 3709, an intra-prediction unit 3710, and/or aninter-prediction unit 3711.

The inter reference sample configuration unit 3709 receives thereference picture from the decoded picture buffer 3708 and configuresthe reference sample required in prediction of the current block. Whenthe current picture is the inter picture, the inter reference sampleconfiguration unit 3709 may configure a reference sample be referring tothe pixel of the similar area in the reference picture. The interreference sample configuration unit 3709 may be combined with variousapplication encoders, such as a scalable video encoder, an encodercapable of parallel processing, etc., and may be provided in a structuredifferent from a structure shown in FIG. 37 depending on the applicationorder.

FIG. 38 is a view illustrating another embodiment of an encoder thatperforms inter reference sample padding and/or inter reference samplefiltering according to the present invention.

The encoder shown in FIG. 38 according to the present invention mayinclude a transform unit 3801, a quantization unit 3802, an entropyencoder 3803, an inverse quantization unit 3804, an inverse transformunit 3805, a deblocking filter 3806, a sample adaptive offset filter3807, a decoded picture buffer (DPB) 3808, an inter reference sampleconfiguration unit 3809, an intra-prediction unit 3810, aninter-prediction unit 3811, a transform picture generation unit 3812,and/or a transform picture buffer 3813.

The inter reference sample configuration unit 3809 receives thereference picture and/or the transform picture from the decoded picturebuffer 3808 and/or the transform picture buffer 3813, and configures thereference sample required in prediction. When the current picture is theinter picture, the inter reference sample configuration unit 3809 mayconfigured the reference sample by referring to the pixel of the similararea in the reference picture and/or the transform picture. The interreference sample configuration unit 3809 may be combined with variousapplication encoders, such as a scalable video encoder, an encodercapable of parallel processing, etc., and may be provided in a structuredifferent from a structure shown in FIG. 38 depending on the applicationorder.

Information on whether inter reference sample padding and/or filteringis applied and/or prediction information according to the presentinvention may be signaled from the encoder to the decoder.

FIG. 39 is a view illustrating an embodiment of a decoder that performsinter reference sample padding and/or inter reference sample filteringaccording to the present invention.

The decoder shown in FIG. 39 according to the present invention mayinclude an entropy decoder 3901, an inverse quantization unit 3902, aninverse transform unit 3903, a filter unit 3904, an intra-predictionunit 3905, an inter-prediction unit 3906, a decoded picture buffer 3907,and/or an inter reference sample configuration unit 3908.

The inter reference sample configuration unit 3908 receives thereference picture from the decoded picture buffer 3907, and configuresthe reference sample required in prediction in the same manner as theencoder. The inter reference sample configuration unit 3908 may beprovided separate from the inter-prediction unit 3906 and theintra-prediction unit 3905, or may be provided as being included in atleast one of the inter-prediction unit and the intra-prediction unit.Also, the inter reference sample configuration unit 3908 may be combinedwith various application decoders, such as a scalable video decoder, adecoder capable of parallel processing, etc., and may be provided in astructure different from a structure shown in FIG. 39 depending on theapplication order.

FIG. 40 is a view illustrating another embodiment of a decoder thatperforms inter reference sample padding and/or inter reference samplefiltering according to the present invention.

The decoder shown in FIG. 40 according to the present invention mayinclude an entropy decoder 4001, an inverse quantization unit 4002, aninverse transform unit 4003, a filter unit 4004, an intra-predictionunit 4005, an inter-prediction unit 4006, a decoded picture buffer 4007,an inter reference sample configuration unit 4008, a transform picturegeneration unit 4009, and/or a transform picture buffer 4010.

The inter reference sample configuration unit 4008 receives thereference picture and/or the transform reference picture from thedecoded picture buffer 4007 and/or the transform picture buffer 4010,and configures the reference sample required in prediction in the samemanner as the encoder. The inter reference sample configuration unit4008 may be provided separate from the inter-prediction unit 4006 andthe intra-prediction unit 4005, or may be provided as being included inat least one of the inter-prediction unit and the intra-prediction unit.Also, the inter reference sample configuration unit 4008 may be combinedwith various application decoders, such as a scalable video decoder, andecoder capable of parallel processing, etc., and may be provided in astructure different from a structure shown in FIG. 40 depending on theapplication order.

The decoder according to the present invention may receive informationon whether inter reference sample padding and/or filtering is appliedand/or prediction information from the encoder.

FIG. 41 is a view illustrating an embodiment of a detailed configurationof the inter reference sample configuration unit according to thepresent invention.

The inter reference sample configuration unit according to the presentinvention may include an inter reference sample padding unit 4101 and/oran inter reference sample filter unit 4102.

Input of the inter reference sample configuration unit is the currentpicture, the reference picture, and/or the current block. The currentblock may be information on the position and/or size of the currentblock in the current picture. The reference picture may be a picturereconstructed before the current picture and/or a geometric transformpicture.

Output of the inter reference sample configuration unit is the referencesample and/or prediction information. The reference sample isinformation on the reference pixel used in intra prediction. Theprediction information may be information on whether inter referencesample padding is used, information on whether inter reference samplefiltering is used, and/or information on the similar area.

The inter reference sample padding unit 4101 may determine the similararea in the reference picture, and may configure the reference sample byusing the reference sample of the current block in the current pictureand/or the pixel of the similar area in the reference picture. The interreference sample filter unit 4102 may filter the reference sample byusing the reference sample of the current block in the current pictureand/or the pixel of the similar area in the reference picture.

In the embodiment explained with reference to FIG. 41, when the encoderand the decoder are able to determine whether inter reference samplepadding and/or inter reference sample filtering is performed in adetermined method, relevant prediction information may not be signaled.

Hereinafter, generation of the geometric-transform reference picturewill be described in detail. In the description, the reference picturemay include or may be replaced with the geometric-transform referencepicture. Also, in the description, the geometric-transform referencepicture may be a geometric transform picture, a transform referencepicture, a transform picture, a geometric transform video, a transformreference video, a transform video, etc. Also, in the description,transform information may be geometric transform information.

FIGS. 42 to 52 are diagrams showing a method of generating a geometricmodified image by geometrically modifying an image.

A geometric modification of an image may refer to geometricallymodifying light information of the image. The light information mayrefer to luminance, color, or chrominance of each point of the image.Alternatively, the light information may refer to a pixel value in adigital image. The geometric modification may refer to a parallelmovement of each point within an image, a rotation of an image, a sizechange of an image, etc.

FIGS. 42 to 46 are diagrams respectively showing a geometricmodification of an image according to the present invention. (x, y) ofeach figure refers to a point of an original image before themodification. (x′, y′) refers to a point that corresponds to the point(x, y) after the modification. Herein, the corresponding point refers toa point in which light information of the (x, y) is moved by thegeometric modification.

FIG. 42 is a diagram showing a transfer modification of an embodiment ofthe geometric modification of the image according to the presentinvention.

In FIG. 42, tx refers to a displacement of each point that hastransferred in an x-axis, and ty refers to a displacement of each pointthat has transferred in a y-axis. Therefore, a point (x′, y′) within theimage is derived by adding tx and ty to a point (x, y) that is a pointwithin the image before the modification. The transform modification maybe represented in a matrix shown in FIG. 42.

FIG. 43 is a diagram showing a size modification of an embodiment of ageometrical modification of an image according to the present invention.

In FIG. 43, sx refers to a size modification multiple in a direction ofan x-axis and sy refers to a size modification multiple in a directionof a y-axis. The size modification multiple may refer to a size ratio ofan image before the modification to an image after the modification.When size modification multiple is equal to 1, it means that a size ofthe image before the modification is equal to a size of the image afterthe modification. When size modification multiple is larger than 1, itmeans that a size of the image is enlarged after the modification. Whensize modification multiple is smaller than 1, it means that a size ofthe image is reduced after the modification. The size modificationmultiple always has a value greater than 0. Therefore, a point (x′, y′)within the size-modified image after the modification is derived bymultiplying sx and sy to a point (x, y) within the image before themodification. The size modification may be represented in a matrix shownin FIG. 43.

FIG. 44 is a diagram showing a rotation modification of an embodiment ofa geometrical modification of an image according to the presentinvention.

In FIG. 44, θ refers to a rotation angle of an image. In the embodimentof FIG. 44, a rotation is performed centering around a point (0, 0) ofan image before the modification. A point (x′, y′) within therotation-modified image after the modification may be derived by using θand a trigonometrical function. The rotation modification may berepresented in a matrix shown in FIG. 44.

FIG. 45 is a diagram showing an affine modification of an embodiment ofa geometrical modification of an image according to the presentinvention.

The affine modification refers to the case where a transfermodification, a size modification, and a rotation modification arecomplexly performed. A geometric modification of the affine modificationmay be varied according to orders of the transfer modification, the sizemodification, and/or the rotation modification that are applied to theimage. The image may be modified in the form of tilting as well as thetransfer modification, the size modification, and the rotationmodification according to the order of applying among a plurality of themodifications composing the affine modification and complex of each ofthe modifications.

In FIG. 45, M_(i) may be a 3×3 matrix for a transfer modification, asize modification, or a rotation modification. According to an order ofmodifications composing the affine modification, a 3×3 matrix may beobtained by matrix product each of matrixes for the modifications byeach other. In FIG. 45, matrix A may correspond to a 3×3 matrix obtainedby matrix product a matrix M₁ to matrix M_(n). The matrix A may consistof elements a1 to a6. A matrix p is a point within an image before themodification that is represented in a matrix. A matrix p′ is a pointwithin an image after the modification and corresponds to the point pwithin the image before the modification. Therefore, the affinemodification may be represented as a matrix equation p′=Ap.

FIG. 46 is a diagram showing a projective modification of an embodimentof a geometrical modification of an image according to the presentinvention.

The projective modification may be an extended affine modification inwhich a perspective modification is added to the affine modification.When an object in a three-dimensional space is projected into atwo-dimensional plane, a perspective modification may occur according toa viewing angle of a camera or an observer. In the perspectivemodification, objects far away are represented to be small and objectsnear are represented to be large.

In FIG. 46, a matrix H may be used for the projective modification.Elements h1 to h6 constituting the matrix H may correspond to theelements a1 to a6 constituting the matrix A for the affine modificationof FIG. 45. Thereby, the projective modification may include the affinemodification. Elements h7 and h8 constituting the matrix H may beelements related to the perspective modification.

A geometric modification of an image is a method for geometricallymodifying the image to a specific form. A point within an image after ageometric modification that corresponds to a point within the imagebefore the geometric modification may be calculated by a geometricmodification defined in a matrix. On the contrary, a homography refersto a method of reversely deriving a mutual geometric modification matrixfrom two images respectively having points corresponding to each other.

FIG. 47 is a diagram showing an example method of implementing ahomography according to the present invention.

The homography may derive a geometric modification relation between twoimages based on an identification of two points within two images andcorrespond with each other. For this, a feature point matching may beused. A feature point of an image refers to a point that has adescriptive feature within the image.

In steps S4701 and S4702, the homography implementing method may extractfeature points from an original image and a geometric modified image. Afeature point of an image may be differently extracted according to anextracting method, or according to a usage purpose. A point within animage in which a luminance value thereof is dramatically changed, acenter point of an area which has a specific shape, or a corner point ofan object within the image may be used as the feature point. The featurepoint may be extracted by using algorithms such as scale-invariantfeature transform (SIFT), speeded up robust feature (SURF), BlobDetection, etc.

In step S4703, the homography implementing method may match the featurepoint based on the feature points extracted from the original image andthe geometric modified image. In detail, each of the extracted featurepoints are descriptive, the feature point between the two images may bematched by finding points having similar description information. Thematched feature points may be used as a point in which the originalimage and the geometric modified image correspond with each other.

However, the feature point matching may not match points that actuallycorrespond with each other. Therefore, in step S4704, a valid featurepoint among the derived feature points may be selected. The method ofselecting the valid feature point may be varied according to acalculation algorithm. For example, methods such as a method ofexcluding a feature point that does not satisfy a baseline based on thedescription information, a method of excluding a feature point in whicha coherency is very low through a distribution of matched results, or amethod of using a random sample consensus (RANSAC) algorithm may beused. The homography implementing method may selectively perform thestep S4704 according to a matching result of the feature points. Inother words, the step S4704 may not be performed according tocircumstances. Alternatively, the steps S4703 and S4704 may be merged.Alternatively, the homography implementing method may perform a matchingprocess of a valid feature point instead of performing steps S4703 andS4704.

In step S4705, the homography implementing method may derive a relationformula between the original image and the geometric modified image byusing the selected valid points. In step S4706, the homographyimplementing method may derive a geometric matrix by using the derivedformula. Alternatively, the homography implementing method may notperform the step S4706, and output information of the derived formulaobtained the step S4705 in a different form except for a geometricmodification matrix.

FIG. 48 is an example method of deriving a relational formula betweentwo points corresponding within two images according to the presentinvention.

A geometric modification of an image may be performed by a 3×3 matrix H.Therefore, simultaneous equations including elements h1 to h9 of thematrix H as unknown quantity may be derived from a matrix formula p′=Hp.Herein, p means a point within an original image, and p′ means a pointwithin a geometric modified image that corresponds to the point p. Theequation may be simply calculated by fixing the h9 to 1 by dividing allelements of the matrix H into h9. Also, the number of unknown quantitymay be reduced to 8 from 9.

Elements k1 to k8 of FIG. 48 correspond to values in which h1 to h8 isdivided by h9. A geometric matrix in which h9 is changed in 1, and h1 toh8 are changed in k1 to k8, respectively, may perform an identicalgeometric modification. Therefore, 8 unknown values may be needed to becalculated. In FIG. 48, the final formula for one pair of pointsmatching with each other in a single point may be expressed in two formsfor x′ and y′. At least 4 pairs of points matching with each other maybe needed since there are 8 unknown values. However, the pair points maymismatch with each other as described above. Alternatively, the pairpoints may match with an error. Such an error may occur even when validfeature points are selected. Such an error may be reduced by using manypair points matching with each other while calculating the geometricmodified matrix. Therefore, a number of the pair points that will beused may be determined considering such features.

FIG. 49 is a diagram showing a method of generating a geometricallymodified image based on a geometric modification matrix and an originalimage according to the present invention.

As shown in FIG. 49, generation of a geometric modified image maycorrespond to a generation of light information of a corresponding pointwithin the geometric modified image by using light information of apoint within an original image. (x0, y0), (x1, y1), and (x2, y2) of FIG.49 refer to different points within the original image. In addition,(x′0, y′0), (x′1, y′1), and (x′2, y′2) are points within the geometricmodified image that respectively correspond to (x0, y0), (x1, y1), and(x2, y2). A function f calculates a corresponding x′ coordinate of anx-axis within the geometric modified image by using the point (x, y)within the original image and additional information a that is used forthe geometric modification. A function g calculates a corresponding y′coordinate of a y-axis within the geometric modified image by using thepoint (x, y) within the original image and additional information β thatis used for the geometric modification. When (x, y), (x′, y′), functionf, and function g are expressed in a matrix formula, a matrix H mayrefer to a geometric modification method. Therefore, points thatmutually correspond to each other within the original image and thegeometric modified image may be found by using the matrix H.

The geometric modification method of FIG. 49 may be problematic in adiscrete sample image signal, since light information is only includedin points having an integer number coordinate of the discrete imagesignal. Therefore, when a point within the geometric modified image andcorresponding to a point within the original image has a real numbercoordinate, light information of the closest integer number coordinateis assigned to the point within the geometric modified image. Thus,light information may be overlapped to a part of points having a realnumber coordinate within the geometric modified image or the lightinformation may not be assigned. In this case, inverse mapping may beused.

FIG. 50 is a diagram showing a method of generating a geometricallymodified image by using inverse mapping according to the presentinvention.

A dotted rectangular area of FIG. 50 refers to an area that is actuallyobserved. Points within an original image corresponding to each pointwithin the dotted rectangular area may be derived. Therefore, lightinformation of the original image may be assigned to all points withinthe geometric modified image. However, a point (x3, y3) corresponding to(x′3, y′3) may place outside of the original image. In this case, lightinformation of the original image may not be assigned to the point (x′3,y′3). In points where light information of the original image is notassigned, neighboring light information of the original image may beassigned. In other words, light information of the closest point withinthe original image, for example, (x4, y4), may be assigned.

FIG. 51 is a diagram showing a method of generating a geometric modifiedimage based on a geometric modification matrix and an original imageaccording to the present invention wherein the geometric modificationmatrix may correspond to geometric modification information.

In step S5101, the generation method may receive input an originalimage, a geometric modification matrix and/or information on the currentpoint of a geometric modified image. The generation method may calculatea point of the original image corresponding to the current point of thegeometric modified image by using the original image and the geometricmodified matrix. The calculated corresponding point of the originalimage may be a real number corresponding point having a real numbercoordinate.

In step S5102, the generation method may determine whether or not thecalculated corresponding point is placed inside the original image.

In step S5102, when the calculated corresponding point is not placedinside the original image, in step S5103, the generation method maychange the point closest to the calculated corresponding point withinthe original image with the corresponding point.

In step S5102, when the calculated corresponding point is placed insidethe original image, the generation method may perform step S5104. Thegeneration method may perform the step S5104 when the calculatedcorresponding point is changed in step S5103.

In step S5104, when the corresponding point has a real numbercoordinate, the generation method may identify a closest point having aninteger number coordinate. When the corresponding point has an integernumber coordinate, the generation method may skip steps S5104 and S5105,and perform step S5106.

In step S5105, the generation method may generate light information ofthe point having a real number coordinate by interpolating lightinformation of the identified point having an integer number coordinate(for example, a pixel value). As methods of interpolating, Lanczosinterpolation, S-Spline interpolation, Bicubic interpolation may beused.

In step S5106, the generation method may check all points within thegeometric modified image whether or not the geometric modificationsthereof are completed. Then, the generation method may finally outputthe generated geometric modified image.

When it is determined that the geometric modifications are not completedin step S5106, in step S5107, the generation method may change a currentpoint of the geometric modified image to another point, and may repeatsteps S5101 to S5106.

FIG. 52 is a diagram showing a bilinear interpolation among variousinterpolation methods explained with reference to FIG. 51 according toan embodiment the present invention.

In FIG. 52, a real number coordinate (x, y) may correspond to the realnumber corresponding point mentioned in step S5104 of FIG. 51. 4 points(i,j), (i,j+1), (i+1,j), and (i+1,j+1) that are adjacent to thecoordinate (x, y) may correspond to the closest point having an integernumber coordinate that is mentioned in step S5104 of FIG. 51. I(x,y) mayrefer to light information of the point (x,y) such as luminance. arefers to an x-axis distance between i and x, and b refers to a y-axisdistance between j and y. 1−a refers to an x-axis distance between i+1and x, and 1−b refers to a y-axis distance between j+1 and y. Lightinformation of the point (x, y) may be calculated from light informationof points (i, j), (i, j+1), (i+1, j), and (i+1, j+1) by using a ratio ofa to 1−a in the x-axis, and a ratio of b to 1−b in the y-axis.

After generating a sample value of a point of the geometric modifiedimage thorough the interpolation method, the same process is repeatedfor other points of the geometric modified image. The generation of thegeometric modified image is completed when sample values for all pointsof the geometric modified image are generated.

The above embodiments may be performed in the same method in an encoderand a decoder.

A sequence of applying to above embodiment may be different between anencoder and a decoder, or the sequence applying to above embodiment maybe the same in the encoder and the decoder.

The above embodiment may be performed on each luma signal and chromasignal, or the above embodiment may be identically performed on luma andchroma signals.

A block form to which the above embodiments of the present invention areapplied may have a square form or a non-square form.

The above embodiment of the present invention may be applied dependingon a size of at least one of a coding block, a prediction block, atransform block, a block, a current block, a coding unit, a predictionunit, a transform unit, a unit, and a current unit. Herein, the size maybe defined as a minimum size or maximum size or both so that the aboveembodiments are applied, or may be defined as a fixed size to which theabove embodiment is applied. In addition, in the above embodiments, afirst embodiment may be applied to a first size, and a second embodimentmay be applied to a second size. In other words, the above embodimentsmay be applied in combination depending on a size. In addition, theabove embodiments may be applied when a size is equal to or greater thata minimum size and equal to or smaller than a maximum size. In otherwords, the above embodiments may be applied when a block size isincluded within a certain range.

For example, the above embodiments may be applied when a size of currentblock is 8×8 or greater. For example, the above embodiments may beapplied when a size of current block is 4×4 or greater. For example, theabove embodiments may be applied when a size of current block is 16×16or greater. For example, the above embodiments may be applied when asize of current block is equal to or greater than 16×16 and equal to orsmaller than 64×64.

The above embodiments of the present invention may be applied dependingon a temporal layer. In order to identify a temporal layer to which theabove embodiments may be applied, a corresponding identifier may besignaled, and the above embodiments may be applied to a specifiedtemporal layer identified by the corresponding identifier. Herein, theidentifier may be defined as the lowest layer or the highest layer orboth to which the above embodiment may be applied, or may be defined toindicate a specific layer to which the embodiment is applied. Inaddition, a fixed temporal layer to which the embodiment is applied maybe defined.

For example, the above embodiments may be applied when a temporal layerof a current image is the lowest layer. For example, the aboveembodiments may be applied when a temporal layer identifier of a currentimage is 1. For example, the above embodiments may be applied when atemporal layer of a current image is the highest layer.

A slice type to which the above embodiments of the present invention areapplied may be defined, and the above embodiments may be applieddepending on the corresponding slice type.

In the above-described embodiments, the methods are described based onthe flowcharts with a series of steps or units, but the presentinvention is not limited to the order of the steps, and rather, somesteps may be performed simultaneously or in different order with othersteps. In addition, it should be appreciated by one of ordinary skill inthe art that the steps in the flowcharts do not exclude each other andthat other steps may be added to the flowcharts or some of the steps maybe deleted from the flowcharts without influencing the scope of thepresent invention.

The embodiments include various aspects of examples. All possiblecombinations for various aspects may not be described, but those skilledin the art will be able to recognize different combinations.Accordingly, the present invention may include all replacements,modifications, and changes within the scope of the claims.

The embodiments of the present invention may be implemented in a form ofprogram instructions, which are executable by various computercomponents, and recorded in a computer-readable recording medium. Thecomputer-readable recording medium may include stand-alone or acombination of program instructions, data files, data structures, etc.The program instructions recorded in the computer-readable recordingmedium may be specially designed and constructed for the presentinvention, or well-known to a person of ordinary skilled in computersoftware technology field. Examples of the computer-readable recordingmedium include magnetic recording media such as hard disks, floppydisks, and magnetic tapes; optical data storage media such as CD-ROMs orDVD-ROMs; magneto-optimum media such as floptical disks; and hardwaredevices, such as read-only memory (ROM), random-access memory (RAM),flash memory, etc., which are particularly structured to store andimplement the program instruction. Examples of the program instructionsinclude not only a mechanical language code formatted by a compiler butalso a high level language code that may be implemented by a computerusing an interpreter. The hardware devices may be configured to beoperated by one or more software modules or vice versa to conduct theprocesses according to the present invention.

Although the present invention has been described in terms of specificitems such as detailed elements as well as the limited embodiments andthe drawings, they are only provided to help more general understandingof the invention, and the present invention is not limited to the aboveembodiments. It will be appreciated by those skilled in the art to whichthe present invention pertains that various modifications and changesmay be made from the above description.

Therefore, the spirit of the present invention shall not be limited tothe above-described embodiments, and the entire scope of the appendedclaims and their equivalents will fall within the scope and spirit ofthe invention.

INDUSTRIAL APPLICABILITY

The present invention may be used in encoding/decoding an image.

The invention claimed is:
 1. An image encoding method performed by animage encoding apparatus, the method comprising: generating a geometrictransform matrix based on geometric transform information, the geometrictransform information comprising at least one of transfer modificationinformation, size modification information and rotation modificationinformation; determining a prediction mode of a current block;determining a reference block of the current block from a referencepicture based on reference points modified by the geometric transformmatrix; and predicting the current block by referring to the referenceblock, wherein, the generating the geometric transform matrix comprises:when the geometric transform information comprises the transfermodification information, determining movement factors for the geometrictransform matrix based on the transfer modification information, andwhen the geometric transform information comprises the size modificationinformation and the rotation modification information, determiningzooming and rotation factors for the geometric transform matrix based onthe size modification information and the rotation modificationinformation.
 2. An image encoding apparatus comprising anintra-prediction unit configured to: generate a geometric transformmatrix based on geometric transform information, the geometric transforminformation comprising at least one of transfer modificationinformation, size modification information and rotation modificationinformation; determine a prediction mode of a current block; determine areference block of the current block from a reference picture based onreference points modified by the geometric transform matrix; and predictthe current block by referring to the reference block, wherein, thegenerating the geometric transform matrix comprises: when the geometrictransform information comprises the transfer modification information,determining movement factors for the geometric transform matrix based onthe transfer modification information, and when the geometric transforminformation comprises the size modification information and the rotationmodification information, determining zooming and rotation factors forthe geometric transform matrix based on the size modificationinformation and the rotation modification information.
 3. An imagedecoding method performed by an image decoding apparatus, the methodcomprising: generating a geometric transform matrix based on geometrictransform information, the geometric transform information comprising atleast one of transfer modification information, size modificationinformation and rotation modification information; decoding a predictionmode of a current block; determining a reference block of the currentblock from a reference picture based on reference points modified by thegeometric transform matrix; and predicting the current block byreferring to the reference block, wherein, the generating the geometrictransform matrix comprises: when the geometric transform informationcomprises the transfer modification information, determining movementfactors for the geometric transform matrix based on the transfermodification information, and when the geometric transform informationcomprises the size modification information and the rotationmodification information, determining zooming and rotation factors forthe geometric transform matrix based on the size modificationinformation and the rotation modification information.
 4. An imagedecoding apparatus comprising an intra-prediction unit configured to;generate a geometric transform matrix based on geometric transforminformation, the geometric transform information comprising at least oneof transfer modification information, size modification information androtation modification information; decode a prediction mode of a currentblock; determine a reference block of the current block from a referencepicture based on reference points modified by the geometric transformmatrix; and predict the current block by referring to the referenceblock, wherein, the generating the geometric transform matrix comprises:when the geometric transform information comprises the transfermodification information, determining movement factors for the geometrictransform matrix based on the transfer modification information, andwhen the geometric transform information comprises the size modificationinformation and the rotation modification information, determiningzooming and rotation factors for the geometric transform matrix based onthe size modification information and the rotation modificationinformation.
 5. A non-transitory readable medium storing a bitstreamformed by an image encoding method, the image encoding method including:generating a geometric transform matrix based on geometric transforminformation, the geometric transform information comprising at least oneof transfer modification information, size modification information androtation modification information; determining a prediction mode of acurrent block; determining a reference block of the current block from areference picture based on reference points modified by the geometrictransform matrix; and predicting the current block by referring to thereference block, wherein, the generating the geometric transform matrixcomprises: when the geometric transform information comprises thetransfer modification information, determining movement factors for thegeometric transform matrix based on the transfer modificationinformation, and when the geometric transform information comprises thesize modification information and the rotation modification information,determining zooming and rotation factors for the geometric transformmatrix based on the size modification information and the rotationmodification information.